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INTRODUCTION 

Smoking-related  cancers  such  as  lung  and  head  and  neck  cancers  are  a  major  cause  of  cancer 
death  in  the  United  States.  About  25%  of  lung  cancer  patients  are  diagnosed  with  stage  I  or  II 
disease  and  undergo  surgery  with  curative  intent,  but  the  5-year  survival  for  this  group  of 
patients  is  only  30%-70%.  Patients  with  a  strong  history  of  smoking  and  prior  early-stage  cancer 
are  found  to  be  at  high  risk  for  cancer  recurrence  or  development  of  second  primary  tumors 
(SPTs).  An  effective  adjuvant  therapy  after  surgery  in  this  group  of  patients  is  not  well 
established  yet.  The  survival  benefit  of  adjuvant  chemotherapy  was  uncertain  until  recent 
findings  reported  by  Winton  and  colleagues  (Winton  et  al.,  2005).  They  found  that  adjuvant 
chemotherapy  (vinorelbine  and  cisplatin)  increases  the  5-year  survival  of  surgically  resected 
non-small  cell  lung  cancer  (NSCLC)  patients,  resolving  the  debate  over  the  benefit  of  adjuvant 
chemotherapy.  Thus,  better-designed  clinical  trials  and  basic  research  are  needed  to  establish 
the  standard  of  care  for  these  patients  after  surgery. 

The  program  VITAL  (Vanguard  Trial  of  Investigational  Therapeutics  in  Adjuvant  Treatment  of 
Lung  Cancer)  initiated  in  2003  was  developed  to  gain  a  better  understanding  of  the  molecular 
events  underlying  the  progression  of  NSCLC  in  order  to  develop  a  risk  model  for  cancer 
recurrence  and  development  of  smoking-related  SPT  in  the  high-risk  population,  and  to  identify 
effective  preventive  agents  for  this  group  of  patients.  Specifically,  our  objectives  are: 

•  To  identify  biologically-based  treatments  for  prevention  of  cancer  recurrence  and 
development  of  second  primary  tumors  in  high-risk  patients; 

•  To  understand  molecular  events  in  premalignant  tissues  that  contribute  to  progression  or 
malignancy; 

•  To  develop  a  risk  prediction  model  for  disease  recurrence  and  development  of  second 
primary  tumors  in  high-risk  patients  by  combining  clinical  treatment  outcomes  with  molecular 
and  imaging  data. 

Three  clinical  trials  were  proposed,  in  part,  to  acquire  the  necessary  correlative  samples  to 
develop  this  risk  model,  which  will  significantly  improve  decision-making  for  patients  and 
physicians  in  the  management  of  this  challenging  disease.  Histologic  assessment  was  planned 
to  determine  whether  malignant  changes  would  occur  during  this  time  period.  Despite 
substantial  efforts,  our  patient  accrual  was  significantly  lower  than  expected  due  to  a  number  of 
factors;  thus,  a  ReVITALization  plan  was  proposed  (see  revised  Aims  below)  and  approved  by 
the  DoD  in  the  previous  funding  period.  Implementation  of  the  alternative  ReVITALization 
strategy  over  the  past  year  was  based  on  the  revised  project  aims  that  were  developed  to 
accomplish  our  goal  of  the  development  of  the  risk  model.  An  overview  of  the  changes  is 
provided  below  with  additional  details  in  each  relevant  project.  It  should  be  noted  that  work 
described  in  Project  1  related  to  these  reviewed  aims  is  planned  to  continue  until  January  2010; 
a  request  for  an  unfunded  12-month  extension  has  been  submitted  for  approval  to  the  DoD. 

ReVITALization  Aims: 

1.  Circumvent  low  accruals  using  surgical  specimens  in  our  tissue  bank.  These 
specimens  (about  500  samples)  of  resected  lung  cancer  will  be  utilized  for  biomarker 
assessment  and  will  serve  as  the  foundation  for  a  biomarker-based  risk  assessment 
model. 

2.  Continue  enrolling  patients  in  our  Vanguard  trial  to  accrue  50-60  patients.  This 
cohort  will  provide  sufficient  biospecimens  for  the  aims  proposed  in  the  other  projects  of 
the  VITAL  program.  Additionally,  the  clinical  data  obtained  from  these  patients  will  be 
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used  to  test  the  biomarker-based  risk  assessment  model  and  the  follow-up 
bronchoscopy  specimens  will  provide  important  information  for  biomarker  changes  in  the 
bronchial  epithelium. 

3.  Close  the  celecoxib  and  erlotinib  trials  to  focus  resources  on  specimen  analyses 
to  develop  the  biomarker  risk  model. 

4.  Perform  two  additional  discovery  projects  related  to  increased  risk  that  are  only 
now  possible  due  to  continued  progress  in  VITAL. 

a.  Identify  gene  expression  signatures  in  bronchial  brush  specimens  using  high- 
throughput  genomics  approach. 

b.  Identify  genes  expression  signatures  in  epithelial  cells  detected  by  LIFE 
bronchoscopy  that  determine  aggressiveness. 

This  report  summarizes  work  conducted  over  the  past  year  of  the  research  period,  highlights 
key  research  accomplishments  and  reportable  outcomes  with  the  bibliography  of  all  publications 
and  meeting  abstracts  derived  from  VITAL  during  this  timeframe. 

PROGRESS  REPORT  (BODY) 

Project  1 :  Biologic  Approaches  for  Adjuvant  Treatment  of  Aerodigestive  Tract  Cancer 

(PI  and  co-PIs:  Drs.  Waun  Ki  Hong,  Edward  S.  Kim,  Rodolfo  C.  Morice,  David  J.  Stewart) 

Aim  1  Assess  the  smoking-related  disease-free  survival  in  patients  who  are  current  or 
former  smokers  with  a  prior  definitively-treated  stage  I/ll  lung  or  head  and  neck 
cancer. 

The  main  objective  for  this  project  was  to  open  the  Vanguard  study  at  MDACC  as  well  as  the  2 
other  participating  sites.  Enrollment  was  planned  for  a  total  of  300  patients  with  definitively 
treated  stage  I/ll  lung  or  head  and  neck  cancer  and  at  least  a  20-pack-year  smoking  history. 
Patients  undergo  baseline  testing  including  chest  x-ray,  CT  scan,  labs,  bronchoscopy,  and  other 
specimen  collections  (i.e.,  sputum,  saliva,  serologies).  Bronchoscopies  and  specimen  collection 
are  performed  at  baseline  and  at  months  12,  24,  and  36.  White-light  alone  or  white-light  and 
autofluorescence  modalities  are  used.  Abnormal  areas  detected  by  bronchoscopy  are  biopsied. 
Histologic  assessment  is  performed  to  determine  whether  malignant  changes  will  occur  during 
the  time  period.  If  severe  dysplasia,  carcinoma  in  situ,  or  carcinoma  is  discovered,  patients 
follow  the  plans  outlined  in  the  clinical  protocol.  Once  patients  have  completed  3  years  of 
testing,  they  are  followed  until  the  study  is  completed.  As  per  the  revised  ReVITALization  aims, 
the  study  will  be  closed  when  a  total  of  50-60  patients  have  been  accrued;  all  patients  will  be 
followed  as  outlined  above. 

Summary  of  Research  Findings 

In  the  past  year,  we  have  continued  enrolling  patients  in  the  VITAL/Vanguard  trials.  We  have 
accomplished  our  goal  for  enrollment  (at  least  50  patients)  and  plan  to  close  the  study  to  new 
patients  after  January  2009.  A  total  of  53  patients  have  been  enrolled  in  the  Vanguard  trial,  with 
two  additional  patients  to  be  enrolled  by  the  end  of  February  2009  (55  total  patients).  Currently, 
31  patients  have  completed  both  the  baseline  and  12-month  bronchoscopy.  Nine  additional 
bronchoscopy  procedures  are  scheduled  to  be  completed  by  January  2010,  which  will  bring  our 
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number  of  evaluable  patients  to  at  least  40  by  the  end  of  the  next  year.  Patient  clinical  data  and 
tissues  have  been  and  continue  to  be  collected  and  will  be  distributed  to  investigators  of  VITAL 
research  projects  through  the  VITAL  Pathology  Core.  Over  the  next  12  months,  the  Pathology 
Core  will  complete  the  immunohistochemistry  (IHC)  analysis  of  the  tissue  microarray  (TMA)  set 
for  Project  1.  For  details,  please  refer  to  the  Pathology  Core  update  (pg.  49).  As  detailed  in  the 
report  from  the  Biostatistical  Core  (see  pg.  47),  this  sample  size  will  allow  us  to  update  the 
ReVITALization  database  with  patient  data  for  analysis  to  support  this  collective  research  effort. 
New  statistical  methods  are  being  used  to  evaluate  the  interactions  for  combination  therapy  to 
determine  if  the  effects  are  synergistic,  additive,  or  antagonistic. 

Plan:  Close  the  Vanguard  trial  to  accrual. 

Following  completion  of  accrual  in  January  2009  and  subsequent  patient  follow-up,  we  will  have 
the  necessary  prospective  specimens  to  fulfill  the  goals  of  Projects  2-5.  As  noted  in  the  revised 
aims,  the  planned  biomarker  analyses  in  these  projects  will  be  supplemented  using 
retrospective  specimens  to  maximize  data  acquisition  while  minimizing  the  time  required  to 
develop  the  proposed  risk  model  (See  Aim  3  and  the  Core  C  report  below). 

Aim  2  Evaluate  effects  of  biologic  agents  as  adjuvant  therapy  on  the  modulation  of 
histology  and  specific  biomarkers  in  this  high-risk  population. 

Current  adjuvant  chemotherapy  offers  some  benefits  in  the  high-risk  patients,  but  is  not  a  long¬ 
term  preventive  strategy.  Our  plan  was  to  open  several  biologic  adjuvant  clinical  trials  with 
novel  agents  such  as  celecoxib,  erlotinib,  lonafarnib,  and  possibly  others;  however,  poor  accrual 
due  to  changes  in  the  standard  of  care  for  lung  cancer  patients  prohibited  successful  completion 
of  the  proposed  trials  and,  thus,  the  trials  were  abandoned  as  previously  described.  Our  revised 
aim  is  now  focused  on  the  timely  development  of  the  risk  model  (see  Aim  3  /  Revised  Task  3). 
The  retrospective  specimens  will  be  used  for  the  proposed  analyses  and  correlated  with  the 
available  clinical  data. 

Summary  of  Research  Findings 

As  noted  previously,  the  celecoxib  trial  was  closed,  and  the  proposed  erlotinib  trial  was 
deferred.  We  will  continue  to  focus  our  efforts  on  the  productive  analysis  of  acquired  samples 
from  the  clinical  trial  and  from  our  tissue  bank,  leading  to  the  timely  development  of  a  lung 
cancer  risk  model. 

Aim  3  Develop  a  lung  cancer  risk  model  to  help  predict  the  likelihood  of  development  of 
relapse  or  new  smoking  related  primary  tumors 

Patients  with  a  history  of  smoking  and  a  prior  surgically  resected  stage  I/ll  head  and  neck  or 
lung  cancer  are  at  high  risk  for  cancer  recurrence  or  SPTs.  There  are  no  standard  interventions 
that  have  been  proven  to  help  reduce  the  risk  of  cancer  occurrence.  A  Gail  risk  model 
implemented  in  the  initial  management  of  breast  cancer  screening  has  proven  useful  and  has 
helped  with  early  detection  and  more  stringent  follow-up  in  the  higher  risk  cohorts.  Patients 
enrolled  in  the  Vanguard  trial  will  have  aggressive  post-operative  follow-up  with  analysis 
including  frequent  serologies,  bronchial  specimens  and  CT  scanning.  Trends  in  these  multiple 
biomarkers  will  be  analyzed  and  used  to  develop  a  predictive  model.  Establishing  a  risk  model 
will  eventually  help  identify  patients  who  may  be  at  higher  risk  for  lung  cancer  development  and 
promote  earlier  interventions  for  prevention. 
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As  noted  previously,  we  have  revised  Aim  3  to  develop  a  lung  cancer  risk  model  to  help  predict 
the  likelihood  of  cancer  recurrence  and  second  primary  tumor  (SPT)  development  utilizing 
clinical,  pathologic  and  biomarker  information  obtained  prospectively  and  retrospectively  from 
the  high-risk  population  of  patients  (Years  4-5). 

Summary  of  Research  Findings 

Dr.  Ignacio  Wistuba,  Pathology  Core  Director,  has  identified  archival  tissue  specimens  in  our 
tissue  bank  from  over  600  surgically  resected  lung  cancers,  stages  I/ll,  acquired  over  the  period 
of  2002-2005.  These  specimens  are  available  for  our  use  and  meet  the  criteria  in  our  proposed 
biomarker  studies.  These  retrospective  specimens  all  have  follow-up  data  for  a  minimum  of  2 
years.  All  NSCLC  case  specimens  with  adjacent  bronchial  structures  will  be  included  in  the 
study.  Complete  clinical  and  pathologic  characteristics  will  need  to  be  audited  and  incorporated 
into  our  shared  VITAL/PROSPECT  database.  We  will  thus  increase  our  samples  size  from  300 
to  500  for  use  in  developing  our  risk  model  by  supplementing  the  available  prospective 
specimens  with  additional  retrospective  specimens.  Our  focus  in  VITAL  is  on  lung  epithelial 
changes,  rather  than  changes  in  the  tumor  itself;  hence,  by  including  all  patients  with  specimens 
from  adjacent  bronchial  epithelium,  the  goal  of  developing  a  risk  model  can  be  achieved.  (See 
Core  C  for  more  detail.) 

We  have  completed  entry  into  the  pathology  database  for  the  500  archival  tissue  specimens  of 
resected  lung  cancers.  The  clinical  database  has  been  developed  and  data  entry  is  in  the 
process  of  being  completed  via  both  patient  record  review  and  contacting  the  patient  or  family 
members  directly.  As  permission  must  be  granted  first  before  person-to-person  contact  can  be 
made,  more  than  300  IRB-approved  letters  to  patients  have  been  mailed  requesting  permission 
to  contact  them  regarding  this  study.  Once  we  receive  patient  permission,  we  will  contact 
individual  patients  directly  for  a  long-term  update  on  their  condition.  Patients  who  continue  to  be 
followed  at  MDACC  and  have  adequate  records  will  not  need  to  be  contacted  through  this 
mechanism. 

Key  Research  Accomplishments 

•  Enrolled  9  more  patients  in  the  Vanguard  study  and  have  2  patients  pending,  for  a  total 
of  55. 

•  Continued  to  collect  patient  clinical  data  and  tissues  for  distribution  to  support  research 
projects  in  the  VITAL  grant. 

•  31  patients  have  completed  the  12-month  mark/bronchoscopy,  with  a  projected  total  of 
at  least  40  evaluable  patients  by  the  end  of  January  2009. 

•  ReVITALization  plan  is  being  implemented  with  data  entry  into  the  pathology  database 
related  to  the  500  archived  tissue  specimens,  and  updating  of  the  clinical  database  for 
these  patients. 

Meeting  Abstracts 

We  plan  to  report  clinical  findings  on  the  Vanguard  Trial  at  a  major  meeting  in  2010. 

Conclusion 

The  completion  of  the  Vanguard  trial  is  important  to  this  grant.  However,  the  ultimate  goal  is  the 
development  of  a  risk  model  for  development  of  SPT  and  recurrence.  This  model  will  be 
developed  utilizing  the  resources  from  both  the  Vanguard  trial  and  our  Lung  Cancer  Tissue 
Bank  as  a  top  priority  for  our  program. 
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Project  2:  Identification  of  Biomarkers  of  Response  to  Chemoprevention  Agents  in 
Lung  Epithelium 

(PI  and  co-PIs:  Li  Mao,  M.D.,  Reuben  Lotan,  Ph.D.,  John  Minna,  M.D.) 

Lung  cancer  continues  to  be  the  deadliest  among  all  cancers  in  the  United  States  with  over 
165,000  deaths  annually  for  the  last  few  years  and  an  overall  5-year  survival  rate  of  less  than 
15%  (Jemal  et  al.,  2007).  Early  detection  of  premalignant  lesions  or  tumors  appears  to  be  an 
efficient  approach  to  reducing  the  morbidity  and  mortality  from  lung  cancer  because  the  survival 
of  early  stage  lung  cancer  patients  is  much  better  than  that  of  patients  with  advanced  cancers. 
Therefore,  new  strategies  for  the  early  diagnosis,  prevention  and  treatment  of  this  dreadful 
disease  are  urgently  needed  (Wistuba  and  Gazdar  2006;  Sato  et  al.,  2007).  The  development  of 
early  detection  tools  for  lung  cancer  requires  improved  molecular  testing  by  identification  and 
understanding  of  early  events  in  the  multi-step  process  of  lung  carcinogenesis,  which  involves 
the  accumulation  of  genetic  and  epigenetic  alterations  over  the  long  course  of  exposure  to 
carcinogens  such  as  tobacco  smoke  (Mao,  2002;  Wistuba  and  Gazdar  2006;  Sato  et  al.,  2007). 
To  date,  there  are  no  validated  biomarkers  for  early  detection.  Moreover,  one  or  a  few  genes 
may  not  provide  sufficient  specificity  given  the  multi-factorial  process  of  lung  carcinogenesis  and 
heterogeneous  nature  of  lung  cancer.  Thus,  the  effort  to  search  for  more  specific  and  sensitive 
biomarkers  of  early  lung  cancer  is  warranted.  The  development  of  high-throughput  gene 
expression  analyses,  e.g.,  DNA-chips  or  microarrays,  provides  opportunities  to  define 
biomarkers  (signatures)  of  risk  of  cancer  development.  During  the  last  few  years,  several 
studies  reported  molecular  classification  of  human  lung  carcinomas  on  the  basis  of  gene 
expression  and  described  numerous  putative  biological  markers  of  cancer  (Meyerson  et  al., 
2004).  However,  only  limited  number  of  studies  has  attempted  to  identify  genes  that  are 
modulated  at  early  stages  of  human  lung  carcinogenesis  such  as  premalignant  state  because  of 
the  limited  availability  of  premalignant  lung  tissues  suitable  for  RNA  extraction.  We 
hypothesized  that  immortalized,  transformed  and  tumorigenic  human  bronchial  epithelial  cell 
(HBEC)  line  models  will  have  similar  abnormalities  in  gene  expression  profiles  as  premalignant 
and  malignant  tissues  in  vivo.  Therefore,  such  cell  models  will  be  useful  to  identify  markers  of 
early  disease. 

We  proposed  to  use  genomic  and  proteomic  analyses  to  identify  changes  in  gene  expression 
(including  mRNA  and  miRNAs)  and  proteins  which  correlate/associate  with  cancer  risk  in  the 
carcinogen  damaged  aerodigestive  tract  field  and  also  use  these  signatures  to  monitor  the 
response  of  this  field  to  chemoprevention.  We  will  develop  and  use  a  model  HBEC  system  to 
study  the  effect  of  specific  oncogenic  changes  and  also  the  response  of  these  manipulated 
HBECs  to  various  carcinogenic  and  chemoprevention  agents.  Thus,  we  will  determine 
modifications  of  these  changes  by  chemopreventive  agents  in  premalignant  cells  in  vitro  and  to 
use  probes  for  the  modified  genes  and  proteins  to  analyze  tissue  specimens  from  individuals 
participating  in  the  chemoprevention  clinical  trials. 

Aim  1  Develop  immortalized  human  bronchial  epithelial  cell  cultures  using  a  subset  of 
patient  tissue  specimens  collected  in  Project  1  and  characterize  the  expression 
profiles  of  these  cells  using  oligonucleotide  based  microarrays. 

The  main  goal  of  this  aim  of  this  project  is  to  establish  these  cultures  from  lung  cancer  patients 
and  persons  without  lung  cancer,  including  those  patients  entered  onto  the  clinical  trial 
described  in  Project  1 ,  and  to  characterize  their  gene  expression  profiles. 
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Summary  of  Research  Findings 

A.  Generating  immortalized  HBECs  and  Small  Airway  Epithelial  Cells  (HSAECs)  from 
different  individuals  that  were  collected  on  various  tissue  procurement  protocols  at  The 
University  of  Texas  Southwestern  and  at  M.  D.  Anderson  Cancer  Center. 

To  date,  we  have  generated  HBECS  from  over  45  different  individuals  and  have  recently 
immortalized  10  peripheral  small  airway  epithelial  cells  (HSAECs)  (Fig.  1). 

These  specimens  represent  males,  females,  a  spectrum  of  smoking  status,  as  well  as  persons 
with  and  without  lung  cancer.  We  have  introduced  oncogenic  exposures  into  several  of  the 
HBECS  and  HSAECs,  such  as  oncogenic  KRASV12  and  EGFR  with  mutations,  that  are  found 
in  patients.  We  have  also  immortalized  15  of  these  same  bronchial  epithelial  specimens  with 
oncogenic  HPV  E6  and  E7,  and  made  several  pairs  of  immortalized  HBECs  and  lung  cancer 
cell  lines  from  the  same  patient. 


SAGM  RPMI  2%  Serum 


Figure  1.  Immortalized  normal  human  small  airway  epithelial  cells  (HSAEC)  growing  in  matrigel  under  various 
growth  conditions.  Human  airway  epithelial  cells  from  the  peripheral  lung  were  immortalized  with  hTERT  and  CDK4  and 
grown  in  defined  small  airway  growth  media  (SAGM)  in  monolayer  culture  (top  panel).  They  were  then  plated  in  matrigel  and 
tested  for  growth  and  differentiation  in  SAGM  or  in  RPMI1640  medium  with  2%  fetal  calf  serum  with  or  without  a  supplement 
to  induce  differentiation  (KIAD:  K,  KGF/FGF-7;  I,  3-isobutyl-1-methylxanthine  (IBMX);  A,  8-bomoadenosine  3’,5’-cyclic 
monophosphate;  D,  dexamethasone).  In  matrigel,  the  HSAECs  undergo  branching  morphogenesis  and  KIAD  aids  to  the 
differentiation,  which  is  partially  reversed  by  serum. 
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B.  Developed  new  ways  and  media  to  have 
the  airway  epithelial  cells  differentiate  and 
study  their  stem  cell  like  properties.  We 

have  used  the  defined  small  airway  growth 
medium  (SAGM)  combined  with  Matrigel 
culture  to  study  the  ability  of  HSAECs  to 
differentiate  in  vitro.  In  matrigel,  they  form 
cyst-like  structures  and  show  increased 
expression  of  CC10  and  SP-C.  This  approach 
has  also  involved  adding  differentiation  factors 
(KIAD,  see  legend  of  Fig.  1 )  to  the  SAGM  or  to 
serum  containing  media.  The  addition  of 
serum  also  promotes  differentiation  with 
increased  expression  of  Type  2  cell  marker 
SP-C  and  Type  1  cell  marker  Aquaporin  5. 
The  HSAECs  do  not  express  Type  2  cell 
markers  SP-A,  SP-C  by  qRT-  PCR,  but  do 
express  low  levels  of  Clara  cell  marker  CC10, 
and  high  levels  of  Notch  stem  cell  markers. 
The  HSAECs  have  a  high  percentage  of  cells 
that  are  aldefluor-positive  as  well  as  express 
high  levels  of  Notch  stem  cell  genes  (both 
markers  of  stem  cells). 

C.  Induced  oncogenic  changes  in  the 
HSAECs  and  performed  genomic  analyses. 

We  have  introduced  various  oncogenic 
changes  into  the  HSAECs  beginning  with  the 
introduction  of  oncogenic  KRASV12,  which 
causes  the  cells  to  lose  some  of  their 
differentiating  ability.  These  “preneoplastic” 
HSAECs  are  being  tested  with 
chemoprevention  agents  as  well  as  having 
microarray  mRNA  expression  profiles  and 
array-based  CGH  copy  number  profiles 
performed. 

D.  Identifying  a  large  number  of  new 
tyrosine  phosphorylation  targets  of 
oncogenes  including  KRASV12  and  mutant 
EGFR  in  HBECs  using  unbiased 
phosphoproteomic  approaches  combined 
with  quantitative  analyses  with  mass 
spectroscopy.  In  collaboration  with  the 
laboratory  of  Dr.  Harold  Varmus  at  Memorial 
Sloan  Kettering  Cancer  Center  (MSKCC),  we 
studied  the  tyrosine  phosphorylation  changes 

induced  in  HBECs  by  the  presence  of  mutant  oncogenes.  We  have  used  unbiased 
phosphoproteomic  approaches,  based  on  quantitative  mass  spectrometry  using  stable  isotope 
labeling  with  amino  acids  in  cell  culture  (SILAC),  to  identify  tyrosine  phosphorylated  proteins  in 
isogenic  human  bronchial  epithelial  cells  (HBECs)  and  human  lung  adenocarcinoma  cell  lines, 
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Figure  2.  Increased  tyrosine  phosphorylation 
of  proteins  in  HBECs  or  adenocarcinoma  cells 
expressing  mutant  EGFR  compared  with  cells 
expressing  mutant  KRAS.  Increased  tyrosine 
phosphorylation  of  proteins  in  HBECs  or 
adenocarcinoma  cells  expressing  mutant  EGFR 
compared  with  cells  expressing  mutant  KRAS. 
Schematic  of  experimental  design  and 
representative  MS  spectra  of  a  peptide  of  ERBB2 
(A)  and  a  peptide  of  EGFR  (C)  identified  in  the 
phosphotyrosine  immunoprecipitates  of  proteins 
from  lysates  of  HBECs  and  adenocarcinoma  cell 
lines,  respectively.  (B)  The  degree  of  tyrosine 
phosphorylation  of  proteins  in  mutant  RAS 
expressing  cells  does  not  correlate  with  that  in 
mutant  EGFR-expressing  cells  as  demonstrated 
in  the  log-log  plot  of  the  two  ratios  obtained  in  this 
experiment  (Del  EGFR/WT  EGFR  and  Mut 
RAS/WT  EGFR).  Log  10  transformation  of  SILAC 
ratios  for  individual  proteins  and  linear  regression 
analysis  yields  a  coefficient  of  determination  (r2) 
of  0.20,  and  a  Pearson  product-moment 
correlation  (r)  of  0.45.  (Guha  U.,  et  al.,  PNAS 
2008;105:14112-14117). 
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expressing  either  of  the  two  mutant  alleles  of  EGFR  ( L858R  and  Del  E746-A750 ),  or  a  mutant 
KRAS  allele,  which  are  common  in  human  lung  adenocarcinomas  (Fig.  2).  The  mass  spec 
results  were  validated  by  standard  Western  blotting  experiments  (Fig.  3). 
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Figure  3.  Validation  of  mass 
spectrometry-based 
quantitation  by 
immunoprecipitation  and 
Western  blots  of  representative 
proteins.  Validation  of  mass 
spectrometry-based  quantitation 
by  immunoprecipitation  and 
Western  blots  of  representative 
proteins.  Immunoprecipitation  was 
done  with  antibodies  to  the 
indicated  proteins  from  lysates  of 
HBECs,  and  Western  blots  done 
with  anti-phosphotyrosine  and 
protein-specific  antibodies.  (Guha 
U.,  etal.,  PNAS  2008;105:141 12- 
14117). 


Tyrosine  phosphorylation  of  signaling  molecules  was  greater  in  HBECs  expressing  the  mutant 
EGFRs  than  in  cells  expressing  wild  type  (WT)  EGFR  or  mutant  KRAS.  Receptor  tyrosine 
kinases  (such  as  EGFR,  ERBB2,  MET,  and  IGF1R),  and  Mig-6,  an  inhibitor  of  EGFR  signaling, 
were  more  phosphorylated  in  HBECs  expressing  mutant  EGFR  than  in  cells  expressing  WT 
EGFR  or  mutant  RAS.  Phosphorylation  of  some  proteins  differed  in  the  two  EGFR  mutant¬ 
expressing  cells;  for  example,  some  cell-junction  proteins  (p-catenin,  plakoglobin,  and  E- 
cadherin)  were  more  phosphorylated  in  HBECs  expressing  L858R  EGFR  than  in  cells 
expressing  Del  EGFR.  There  were  also  differences  in  degree  of  phosphorylation  at  individual 
tyrosine  sites  within  a  protein;  for  example,  a  previously  uncharacterized  phosphorylation  site  in 
the  nucleotide-binding  loop  of  the  kinase  domains  of  EGFR  (Y727),  ERBB2  (Y735),  or  ERBB4 
(Y733),  is  significantly  more  phosphorylated  in  HBECs  expressing  the  deletion  mutant  than  in 
cells  expressing  the  wild  type  or  L858R  EGFR.  Signaling  molecules  not  previously  implicated  in 
ERBB  signaling,  such  as  polymerase  transcript  release  factor  (PTRF),  were  also 
phosphorylated  in  cells  expressing  mutant  EGFR.  Bayesian  network  analysis  of  these  and 
other  datasets  revealed  that  PTRF  might  be  a  potentially  important  component  of  the  ERBB 
signaling  network  (Fig.  4).  This  proteomic  information  provides  important  new  biomarkers  and 
potential  targets  for  chemoprevention  of  these  specific  oncogenic  changes  in  human  airway 
epithelial  cells,  and  provides  a  new  model  to  identify  similar  markers  after  other  oncogenic 
changes. 
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Aim  2  Characterize  effects  of  the  chemo  preventive  agents  used  in  Project  1  on  cell 
proliferation  and  apoptosis  in  the  immortalized  human  bronchial  epithelial  cell 
cultures  developed  in  Specific  Aim  1. 

We  will  determine  the  potential  role  of  different  chemo  preventive  agents  [e.g.,  celecoxib,  N-[4- 
hydroxyphenyl]retinamide  (4-HPR),  Iressa  (gefitinib),  and  SCH63663]  alone  or  in  combination 
with  one  another  for  their  effects  on  cell  proliferation  and  apoptosis  in  cell  cultures  established  in 
Aim  1 .  We  will  also  determine  the  relative  sensitivity  among  the  various  cell  cultures  to  each  of 
the  agents  by  determining  the  50%  growth  inhibitory  concentration  (IC50). 

Summary  of  Research  Findings 

Results  from  this  completed  Specific  Aim  were  presented  previously. 

Aim  3  Identify  gene  expression  and  protein  “signatures”  which  reflect  lung 
tumorigenesis  and  sensitivity  or  resistance  to  chemo  preventive  regimens 
proposed  in  Project  1,  and  to  validate  the  signatures  and  to  determine  their 
biological  importance  in  precancer  cell  models  of  lung  cancer. 

Summary  of  Research  Findings 

A.  Perform  high-density  array  CGH  analysis  of  lung  cancers  to  identify  regions 
commonly  amplified  in  lung  cancer.  In  collaboration  with  the  laboratories  of  Wan  Lam  at  the 
Vancouver  British  Columbia  Cancer  Agency  (BCCA)  and  Jon  Pollack  at  Stanford  University 
Medical  Center,  we  have  been  studying  DNA  copy  number  changes  in  lung  cancers  and 
preneoplastic  tissues,  identifying  the  key  genes  within  these  amplicons  and  then  performing 
functional  studies  to  know  which  are  the  most  important  in  the  pathogenesis  of  lung  cancer. 
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Chromosomal  translocation  is  the  best-characterized  genetic  mechanism  for  oncogene 
activation.  However,  there  are  documented  examples  of  activation  by  alternate  mechanisms,  for 
example  gene  dosage  increase,  though  its  prevalence  is  unclear.  Here,  we  answered  the 
fundamental  question  of  the  contribution  of  DNA  amplification  as  a  molecular  mechanism  driving 
oncogenesis.  Comparing  104  cancer  lines  representing  diverse  tissue  origins  identified  genes 
residing  in  amplification  'hotspots,'  we  discovered  an  unexpected  frequency  of  genes  activated 
by  this  mechanism  (Fig.  5).  The  amplification  regions  were  correlated  with  microarray  data  and 
the  overexpressed  genes  in  each  of  the  regions  were  identified  (Fig.  6).  The  3,431  amplicons 
identified  represent  approximately  10  per  hematological  and  approximately  36  per  epithelial 
cancer  genome.  Many  recurrently  amplified  oncogenes  were  previously  known  to  be  activated 
only  by  disease-specific  translocations.  The  135  hotspots  identified  contain  538  unique  genes 
and  are  enriched  for  proliferation,  apoptosis  and  linage-dependency  genes,  reflecting  functions 
advantageous  to  tumor  growth.  Integrating  gene  dosage  with  expression  data  validated  the 
downstream  impact  of  the  novel  amplification  events  in  both  cell  lines  and  clinical  samples.  For 


Figure  4.  Bayesian  network  modeling  of 
phosphorylation  data  from  the  current 
study  and  eight  other  published  ERBB 
signaling  related  tyrosine  phosphorylation 
data.  Bayesian  network  modeling  of 
phosphorylation  data  from  the  current  study 
and  eight  other  published  ERBB  signaling 
related  tyrosine  phosphorylation  data.  (A)  The 
heatmap  of  the  18  proteins  with  the 
approximations  for  the  missing  data 
generated  by  the  “nearest  neighbor  method” 
using  a  discrete  distance  measure.  (B)  The 
top  scoring  Bayesian  network  generated  from 
the  above  phosphorylation  data.  The  nodes 
are  obtained  from  the  phosphorylation  ratios 
from  the  datasets.  The  edges  represent  direct 
and  indirect  causal  influence.  The  arrows 
indicate  the  direction  of  causality.  Nodes 
connected  by  black  edges  have  the  same 
phosphorylation  level  more  often  than  not, 
indicating  a  positive  influence.  EGFR  and 
ERBB2,  were  forced  source  nodes  in  the 
network;  they  were  only  allowed  outgoing 
edges  except  between  each  other.  Nodes 
were  restricted  to  have  no  more  than  three 
parents.  Yellow  nodes  are  kinases  and  blue 
nodes  phosphatases.  Note  central  role  of 
PTRF  (Guha  U.,  etal.,  PNAS 
2008:105:141 12-141 171. 
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example,  multiple  downstream  components  of  the  EGFR  family  signaling  pathway,  including 
CDK5,  AKT1  and  SHC1,  are  overexpressed  as  a  direct  result  of  gene  amplification  in  lung 
cancer  (Fig.  7).  Our  findings  suggest  that  amplification  is  far  more  common  a  mechanism  of 
oncogene  activation  than  previously  believed  and  that  specific  regions  of  the  genome  are 
hotspots  of  amplification.  These  amplicons  provide  important  new  biomarkers  that  may  be 
present  in  preneoplastic  lesions  (see  example  below  in  Fig.  10)  and  could  be  promising  targets 
for  the  monitoring  and  chemoprevention  of  lung  cancer. 
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B.  Study  of  the  expression  of  all  48  nuclear  hormone  receptors  (NRs)  reveals  tumor 
specific  differences  in  microdissected  tumor  and  normal  lung  epithelium.  In  collaboration 
with  Dr.  Wistuba,  the  Minna  and  Mangelsdorf  laboratories  at  UTSW  studied  the  expression  of  all 
48  NRs  using  qRT  PCR  technology  on  RNA  isolated  from  microdissected  tumors  and  normal 
lung  epithelium  from  30  patients  (Fig.  8).  This  approach  revealed  NRs  whose  expression 
differed  between  tumor  and  normal  lung  from  the  same  patient,  and  also  showed  great  variation 
in  the  expression  of  NRs  between  different  lung  cancers.  Our  ultimate  goal  would  be  to  use 
these  NRs  expression  profiles  to  hormonally  manipulate  lung  cancer  or  as  chemoprevention 
targets.  We  are  interested  if  the  pattern  of  NR  expression  in  tumor  may  yield  prognostic 
information  on  patient  survival  (Fig.  9).  This  approach  was  validated  on  an  entirely  separate 
lung  cancer  consortium  data  set  of  442  lung  adenocarcinomas,  and  demonstrates  the  biologic 
relevance  of  NR  expression  patterns  in  lung  cancer. 
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Figure  5.  Hotspots  of  amplification  present  in  cancer  genomes.  A  histogram  summarizing  the  regions 
of  amplification  detected  on  SMRT  array  v.2  (BAC  array)  across  all  104  cancer  cell  line  samples  with  the 
resulting  values  scaled  to  the  segment  with  the  highest  count  (  shown  as  28)  and  plotted  against  their 
corresponding  genomic  position.  Hotspots  are  denoted  by  the  dark  blue  shading,  whereas  the  light  blue 
shading  represents  regions  amplified  ~5  times.  Triangles  mark  common  fragile  sites. 

(Lockwood,  Oncogene  27:4615,  2008). 
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Figure  6.  Impact  of  amplification  on  gene 
transcription  levels.  The  relative  expression  values  for 
genes  with  amplification  and  those  with  neutral  copy 
number  status  are  plotted  as  heatmaps  for  over¬ 
expressed  genes  from  representative  hotspots.  The 
expression  values  for  each  gene  have  been  normalized 
and  scaled  across  the  samples  from  0  to  100.  Red 
intensity  indicates  expression  level.  (Data  from  N  =  27 
NSCLC  cell  line  samples  and  3  HBEC  samples  using 
U133A  and  U133B  Affymetrix  array  data  with  23,583 
unique  genes).  (From  Lockwood,  et  al., Oncogene 
27:4615,2008). 
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Figure  7.  Frequent  amplification  and 
overexpression  of  multiple  EGFR-family- 
signaling  components  in  non-small  cell  lung 
cancer  (NSCLC).  Diagram  highlights  the 
interaction  of  EGFR,  SHC1,  CDK5,  SHC1  and 
MYC  in  the  EGFR  family-signaling  pathway. 
Altered  components  are  shaded  grey  and  the 
effect  on  various  cell  pathways  indicated  (From 
Lockwood,  et  al.,  Oncogene  27:4615,  2008). 
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Figure  8.  Quantitative  mRNA  expression  of  all  48  nuclear  hormone  receptors  (NRs)  in  pair  matched 
tumor  and  lung  epithelium  microdissected  from  the  same  patients  (N  =  30).  Note  differences  in  expression 
between  tumor  and  normal  epithelium  in  many  patients  for  several  receptors  and  the  great  variation  in  expression 
of  NRs  between  different  tumor  samples.  X  axis,  each  of  the  30  samples;  Y  axis,  the  level  of  mRNA  expression 
for  each  receptor  determined  by  Q  RT  PCR.  White  bars,  expression  of  NRs  in  histologically  normal  lung 
epithelium;  black  bars,  expression  of  the  NR  in  the  matched  tumor  sample. 


Aim  4  Develop  techniques  to  assess  these  molecular  signatures  in  tissue  specimens 
and  serum  obtained  in  Project  1,  and  assess  the  relevance  of  these  molecular  signatures 
as  in  vivo  biomarkers  using  baseline  and  post-treatment  specimens. 

Summary  of  Research  Findings 

A.  Identify  regions  amplified  in  preneoplastic  lung  lesions.  The  preferential  selection  of 
gene  amplification  events  would  drive  tumorigenesis  in  specific  cell  lineages  giving  rise  to  lung 
cancer  subtypes.  As  part  of  the  aCGH  studies  in  collaboration  with  Dr.  Wan  Lam,  we 
discovered  a  new  cell  lineage  specific  genetic  event  that  may  provide  novel  target  for  new 
treatment  strategies.  Through  integrative  genetic  analyses  of  multiple  independent  cohorts  of 
clinical  tumor  samples,  we  identify  the  overexpression  of  BRF2  (chromosome  region  8p),  a  RNA 
polymerase  III  (Pol  III)  transcription  initiation  factor,  as  the  result  of  increased  gene  dosage  in 
the  squamous  cell  lineage  leading  to  lung  squamous  cell  carcinoma  (SqCC).  We  found  that 
BRF2  was  amplified  and  overexpressed  in  preneoplastic  carcinoma  in  situ  (CIS)  lesions  in  the 
airway  epithelium  of  lung  cancer  patients  (Fig.  10). 
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MDA  LOOCV  u  Consortium  training  and  testing 


Figure  9.  Expression  of  NRs  in  tumor 
provides  prognostic  information  for  the 
survival  of  non  small  cell  lung  cancer 
patients  that  is  validated  on  a  lung  cancer 
consortium  dataset.  Prediction  model  of  the 
NR  signature  is  cross-validated  by  leave-one-out 
method,  independently  validated  in  multi- 
institutional  consortium  dataset,  and  cross- 
validated  by  different  platforms  of  genetic 
analysis.  (Multi-institutional  data  on  N  =  442 
lung  adenocarcinomas  is  from  Shedden  K,  et  al. 
Gene  expression-based  survival  prediction  in 
lung  adenocarcinoma:  a  multi-site,  blinded 
validation  study.  Nat  Med  2008.)  (LOOCV, 
leave-one-out  cross  validation).  The  MD 
Anderson  Cancer  Center  (MDA)  30-patient  data 
set  was  (a)  first  validated  by  LOOCV  and  then 
tested  on  the  Consortium  NR  dataset  derived 
from  microarray  expression  profile  data  (panel 
C).  Similarly,  the  Consortium  data  set  was 
divided  into  training  and  testing  components  for 
NR  expression  (Panel  B)  and  validated.  Finally, 
the  Consortium  NR  expression  signature  was 
validated  on  the  MDACC  data  set  (Panel  D). 


a  N  QS  i  b  Am^jfotion  c  CIST  CISZ 


Figure  10.  Amplification  and  overexpression  of  BRF2 
in  preinvasive  SqCC  lesions,  (a)  Frequent  copy  number 
increase  of  chromosome  arm  8p  in  20  bronchial  CIS 
lesions.  Samples  are  ordered  in  columns  and  ordered  by 
genomic  position  along  8p.  The  color  scale  ranges  from 
white  (neutral  copy  number,  N)  to  black  (amplification, 
Amp).  Data  from  representative  normal  lung  (N)  and 
SqCC  tumor  samples  (T)  are  displayed  to  the  left  and  right 
of  the  CIS  cases  respectively,  (b)  Amplification  score 
along  chromosome  8p  for  the  20  CIS  cases.  Regions  of 
amplification  were  defined  for  each  case  and  summarized 
across  the  group  to  determine  the  incidence  of 
occurrence.  Dashed  lines  represent  the  positions  of  BRF2, 
WHSC1LC  and  FGFR1  from  top  to  bottom  respectively. 

(c)  Array  CGH  copy  number  profiles  for  two  individual  CIS 
cases  with  8p  amplification.  Each  black  dot  represents  an 
array  element  ordered  by  genomic  position.  Those  shifted 
to  the  left  of  the  middle  line  (N)  have  decreased  copy 
number  (Del)  whereas  those  shifted  to  the  right  have 
increased  copy  number  (Amp).  Dashed  lines  represent  the 
positions  of  the  three  genes  as  in  c.  The  region  highlighted 
in  orange  represents  the  region  of  high  level  amplification 
in  each  sample.  The  amplicon  in  CIS1  includes  only  BRF2 
with  WHSC1L1  and  FGFR1  outside  or  spanning  the 
boundaries  while  the  amplicon  in  CIS2  contains  all  three 
genes,  (d)  Immunostaining  of  CIS2  with  anti-BRF2 
polyclonal  antibody  revealed  elevated  staining  in  CIS 
epithelia  compared  with  normal  from  the  same  tissue 
section.  (From  Lockwood  et  al.  BRF2  is  a  Lineage  Specific 
Oncogene  Amplified  Early  in  Squamous  Cell  Lung  Cancer 
Development.  Proc  Natl  Acad  Sci  U  S  A  2008;  (Submitted, 
PNAS  2008-11342.) 
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Frequent  activation  of  BRF2  in  pre-invasive  bronchial  carcinoma  in  situ  and  dysplastic  lesions 
provides  evidence  that  BRF2  expression  is  an  early  event  in  cancer  development  of  this  cell 
lineage.  Ectopic  expression  of  BRF2  in  human  bronchial  epithelial  cells  induces  a  transformed 
phenotype  and  demonstrates  downstream  oncogenic  effects,  while  siRNA-mediated  knockdown 
suppresses  growth  of  cells  overexpressing  BRF2  (data  not  shown).  Our  data  suggests  that 
genetic  alteration  of  BRF2  represents  a  novel  mechanism  of  lung  tumorigenesis  through  the 
increase  of  Pol  Ill-mediated  transcription  in  cancer. 

Aim  5.  Identify  gene  expression  signatures  that  characterize  progression  from 
immortalized  to  transformed  to  tumorigenic  human  bronchial  epithelial  cells  based  on 
already  available  high-throughput  gene  expression  microarray  data  and  validate  these 
signatures  using  tissue  microarrays  (TMAs)  containing  normal  bronchial  epithelium, 
hyperplasia,  squamous  metaplasia,  dysplasias,  squamous  cell  carcinomas,  atypical 
adenomatous  hyperplasia,  and  adenocarcinomas  (Years  4-5). 

Summary  of  Research  Findings 

We  used  the  in  vitro  human  lung  carcinogenesis  model  described  in  the  previous  year’s  report. 
This  model  includes  normal  human  bronchial  epithelial  (NHBE)  cells,  BEAS-2B  cells,  which  are 
NHBE  immortalized  with  SV40  T /  Adeno12  virus  and  transformed  (1198)  and  tumorigenic 
(1170-1)  cells  derived  from  BEAS-2B  after  exposure  to  cigarette  smoke  condensate  in  vivo. 
Immortalized  HBECs  (1799),  derived  from  BEAS-2B  without  exposure  to  cigarette  smoke 
condensate,  were  also  used  in  this  model.  These  isogenic  cells  represent  normal,  immortalized, 
transformed  and  tumorigenic  cells  and  their  study  offers  an  opportunity  to  identify  different 
progressive  changes  in  genotype  or  epigenetic  changes. 

During  last  year,  we  further  examined  the  functional  significance  and  clinical  relevance  of  the 
differentially  expressed  genes.  We  used  the  Ingenuity  Pathways  Analysis®  software  to  gain  a 
pathway  level  of  understanding  of  genes  differentially  expressed  between  the  normal, 
immortalized,  transformed,  and  tumorigenic  cells  and  how  they  fit  within  molecular  interaction 
networks.  We  found  that  many  of  the  differentially  expressed  genes  (e.g.,  UBE2C,  MCM2, 
FEN1,  and  BIRC5)  function  in  cell  cycle  control  and  display  significant  interactions  with  each 
other  and  with  other  known  cancer-related  genes  in  significant  topological  gene  networks 
(Fig.  11). 
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IBM 


Figure  11.  Network  of  genes  related  to  cell  cycle  progression  significantly  modulated  in  lung  carcinogenesis 
model  including  normal  (NHBE),  B)  immortalized  (1799);  C)  transformed  (1198);  and  D)  tumorigenic  (1170-1) 
cells.  The  changes  in  gene  expression  relative  to  normal  NHBE  (A,  grey)  are  colored  as  follows:  Red, 
upregulated;  green,  downregulated;  grey,  unchanged. 


We  validated  several  of  the  more  differentially  expressed  genes  by  quantitative  PCR  and 
western  blotting  and  found  the  expression  of  MCM2  (minichromosome  maintenance  2),  TPX2 
[microtubule-associated,  homolog  (Xenopus  laevis)],  FEN  (Flap  endonuclease  1),  and  UBE2C 
(Ubiquitin  conjugating  enzyme  E2C)  increased  from  normal  to  tumorigenic  states,  whereas  the 
expression  of  SFN  (stratifin  or  14-3-3-sigma)  decreased  (Fig  12).  The  expression  of  these 
genes  in  cell  lines  derived  from  human  lung  tumors  was  similar  to  the  1170  tumorigenic  cells  in 
our  cell  model. 
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Figure  12.  Validation  of  the  differential  expression  of  MCM2,  SFN,  TPX2,  FEN,  and  UBE2C  genes 
in  cells  comprising  the  in  vitro  carcinogenesis  model  (A)  by  quantitative  PCR.  Western  blotting 
analysis  of  these  selected  genes  in  the  in  vitro  carcinogenesis  model  (B)  and  several  NSCLC  cell 
lines  from  human  tumors  (C). 


To  determine  the  lung  cancer  relevance  of  the  data  obtained  using  the  in  vitro  lung 
carcinogenesis  system,  we  used  in  silico  analysis  of  data  published  by  Garber  et  al.  and  found 
that  UBE2C  expression  was  higher  in  lung  tumors,  both  adenocarcinomas  and  squamous  cell 
carcinomas,  relative  to  normal  lung  tissue  (Fig.  13). 


adenocarcinoma 


UBE2C  mRNA 
Garber  cohort 


see 


Lung 


Adenocarcinoma 
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Figure  13. 
Validation  of  the 
differential 
expression  of 
the  UBE2C 
gene  by  in  silico 
analysis  of 
publicly 

available  data  of 
gene 

expression  in 
lung  cancer  and 
normal  lung 
tissue 

specimens  from 
the  Garber 
cohort. 


Further  studies  were  focused  on  UBE2C  because  it  plays  important  roles  in  the  ubiquitination 
pathway,  which  has  been  shown  to  affect  cyclin-A  degradation  and  control  the  process  by  which 
cells  proceed  from  DNA  segregation  and  cell  division  to  a  new  round  of  DNA  replication.  UBE2C 
is  over  expressed  in  different  types  of  cancer. 
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However,  the  relationship  between  UBE2C  expression  and  lung  cancer  development  and 
prognosis  is  not  known.  In  collaboration  with  Dr.  Ignacio  Wistuba,  we  found  using 
immunohistochemical  methods  that  UBE2C  protein  level  is  undetectable  in  normal  appearing 
lung  epithelium,  but  its  level  increases  in  aberrantly  proliferating  lesions  and  premalignant 
lesions  with  a  further  increase  with  the  severity  of  the  progression  to  malignancy  along  the  SCC 
development  pathway  (Fig.  14).  In  addition,  there  was  an  increased  expression  in 
adenocarcinoma  compared  to  normal  epithelium  (Fig.  15). 


UBE2C  immunohistochemical  staining 


Figure  14. 

Immunohistochemical  staining 
of  UBE2C  in  histological 
sections  of  lung  tissues 
including  normal  epithelium, 
hyperplasia,  squamous 
metaplasia,  mild  dysplasia, 
severe  dysplasia,  and 
squamous  cell  carcinoma 
(SCC). 
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Overall  survival  Progression-free  survival 


Figure  16.  Kaplan-Meier  plots  and  log-rank  statistics  for  the  overall  survival  (A)  and 
progression-free  survival  (B)  of  stage  I  NSCLC  patients  separated  according  to 
UBE2C  exDression. 


Finally,  we  examined 
whether  the 

expression  of  UBE2C 
is  related  to 
prognosis  of  lung 
cancer  patients  with 
stage  I  NSCLC. 
Figure  16  shows  that 
both  overall  survival 
and  progression-free 
survival  were 

statistically 
significantly  lower  in 
those  cases  where 
UBE2C  expression 
was  positive. 


Aim  6.  Identify  gene  expression  signatures  in  bronchial  brush  specimens  from  the  50-60 
patients  enrolled  in  the  Vanguard  study  using  high-throughput  genomics  approach 
(Years  4-5). 

Summary  of  Research  Findings 

1 .  Samples  from  patients  enrolled  in  Vanguard  project. 

We  have  been  collecting  bronchial  brush  samples  from  Vanguard  project  (ID:  2003-0424).  As 
of  12/12/08,  we  have  collected  samples  from  45  patients  at  baseline,  29  patients  at  12  months, 
15  patients  at  24  months,  and  5  patients  at  36  months  during  follow-up  period  (total  95  time 
points).  A  summary  of  the  sample  information  is  presented  in  the  following  table. 


Site  of  bronchial  brush 

No.  of 
timepoints 

LB10* 

LUL* 

MC* 

RB10* 

RML* 

RUL* 

Baseline 

45 

44 

44 

44 

44 

44 

44 

12  months 

29 

29 

29 

29 

25 

29 

29 

24  months 

15 

15 

15 

15 

15 

15 

15 

36  months 

5 

5 

5 

5 

5 

5 

5 

Total 

95 

560 

*  LB10,  left  lower  lobe;  LUL,  left  upper  lobe;  MC,  main  carina;  RB10,  right  lower  lobe;  RML,  right  middle  lobe;  RUL, 
right  upper  lobe. 


2.  RNA  preparation 

Total  RNA  have  been  extracted  from  all  samples  from  205  samples  including  all  samples  from 
MC  sites. 
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3.  Global  gene  expression  analysis 

Among  the  205  RNA  samples,  we  have  successfully  amplified  and  hybridized  138  samples 
using  Affymetrix  ST1.0  Human  gene  chips.  The  priority  has  been  given  to  samples  at  the  MC 
site  collected  at  baseline  and  12  months  in  order  to  address  whether  gene  expression  profiles  at 
the  main  airway  can  be  affected  by  lung  cancer  risk  (comparing  these  profiles  with  those 
obtained  from  profiles  of  those  without  lung  cancer)  as  well  as  by  the  presence  of  tumors 
(comparing  the  profiles  with  profiles  obtained  from  lung  cancer  patients  before  tumors  being 
resected).  The  following  table  summarizes  samples  with  available  gene  expression  profiles. 


No.  of 
MC  sites 

No.  of  sample 
(other  site  of  diseased 
lung) 

No.  of  sample  (normal 
lung) 

Total  sites 

63 

37 

38 

138 

No.  of 

Patient 

samples 

63 

37 

We  are  now  in  the  process  to  perform  gene  expression  profiles  for  additional  samples  and  will 
analyze  the  profiles  to  address  the  questions  proposed  in  the  Revitalization  Proposal. 

Key  Research  Accomplishments 

•  Generated  HBECS  from  over  45  different  individuals  and  immortalized  10  peripheral 
small  airway  epithelial  cells  (HSAECs). 

•  Testing  “preneoplastic”  HSAECs  with  chemoprevention  agents  and  performing 
microarray  mRNA  expression  profiles  and  array-based  CGH  copy  number  profiles. 

•  Compared  104  cancer  lines  representing  diverse  tissue  origins  and  discovered  an 
unexpected  frequency  of  activated  genes  in  amplification  'hotspots.' 

•  Using  qRT  PCR  technology  on  RNA  isolated  from  microdissected  tumors  and  normal 
lung  epithelium  from  30  patients,  found  nuclear  hormone  receptors  whose  expression 
differed  between  tumor  and  normal  lung  from  the  same  patient,  and  between  different 
lung  cancers. 

•  Discovered  that  BRF2  was  amplified  and  overexpressed  in  preneoplastic  carcinoma  in 
situ  (CIS)  lesions  in  the  airway  epithelium  of  lung  cancer  patients 

•  Differentially  expressed  genes  such  as  UBE2C,  MCM2,  FEN1,  and  BIRC5  are  located 
within  gene  interaction  networks  significantly  modulated  between  normal,  immortalized, 
transformed  and  tumorigenic  cells. 

•  The  findings  suggest  that  changes  in  MCM2,  SFN,  TPX2,  FEN,  and  UBE2C  genes  are 
all  important  for  the  replication  of  immortalized,  transformed  and  tumorigenic  cells. 

•  UBE2C  protein  level  is  undetectable  in  normal  appearing  lung  epithelium,  but  its  level 
increases  in  aberrantly  proliferating  lesions  and  premalignant  lesions,  with  a  further 
increase  with  the  severity  of  the  progression  to  malignancy  along  the  SCC  development 
pathway.  In  addition,  there  was  an  increased  expression  in  adenocarcinoma  compared 
to  normal  epithelium 

•  In  lung  cancer  patients  with  stage  I  NSCLC,  both  overall  survival  and  progression-free 
survival  were  statistically  significantly  lower  in  those  cases  where  UBE2C  expression 
was  positive. 

•  Successfully  amplified  and  hybridized  138  samples  using  Affymetrix  ST1.0  Human  gene 
chips. 
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Conclusions 

•  Amplification  is  a  far  more  common  mechanism  of  oncogene  activation  than  previously 
believed  and  specific  regions  of  the  genome  are  hotspots  of  amplification.  These 
amplicons  provide  important  new  biomarkers  that  may  be  present  in  preneoplastic 
lesions  and  could  be  promising  targets  for  the  monitoring  and  chemoprevention  of  lung 
cancer. 

•  NRs  may  provide  prognostic  value  in  lung  cancer  patients. 

•  Genetic  alteration  of  BRF2  represents  a  novel  mechanism  of  lung  tumorigenesis  through 
the  increase  of  Pol  Ill-mediated  transcription  in  cancer. 

•  Differentially  expressed  genes  (e.g.,  UBE2C,  MCM2,  FEN1,  and  BIRC5)  function  in  cell 
cycle  control  and  display  significant  interactions  with  each  other  and  with  other  known 
cancer-related  genes  in  significant  topological  gene  networks. 

•  UBE2C  appears  to  be  an  promising  prognostic  biomarker  for  lung  cancer  survival. 

•  Patient  survival  and  progression-free  survival  were  statistically  significantly  lower  in 
those  cases  where  UBE2C  expression  was  positive. 


Project  3:  Premalignant  Bronchial  Epithelia:  Molecular  and  Cellular  Characterization  of 
Lung  Tumorigenesis 

(PI  and  co-PIs:  Walter  Hittelman,  Ph.D.,  Ja  Seok  Koo,  Ph.D.,  Rodolfo  C.  Morice,  M.D.) 

Aim  1  Identify  and  characterize  differentially  expressed  genes  in  the  LIFE 
bronchoscopy-identified  abnormal  areas  of  the  bronchial  epithelia  of  of  enrolled 
subjects  in  VITAL  trials. 

Previous  studies  have  shown  that  bronchial 
fluorescence  endoscopy  (LIFE) 
bronchoscopy  show  increased  genetic 
changes  when  compared  to  normal¬ 
appearing  sites,  even  if  there  are  no 
differences  in  histological  appearance. 

Since  LIFE-positive  lesions  are  at  increased 
risk  for  cancer  development,  especially 
when  they  contain  particular  genetic 
alterations,  we  hypothesize  that  these  LIFE- 
positive  sites  represent  lesions  at  an  early 
stage  of  tumorigenesis  and  may 
differentially  express  genes  important  for 
driving  tumorigenesis.  Thus,  comparative 
gene  expression  analyses  between  LIFE- 
positive  and  LIFE-negative  sites  within  the 
same  individual  may  provide  a  filter  for 
identifying  genes  whose  levels  of 
expression  are  important  for  driving 
tumorigenesis. 


regions  that  appear  abnormal  by  light-induced 


Figure  1.  The  distribution  of  gene  expression  ratio  of 
abnormal  area  vs.  normal  area  among  all  brush 
samples.  RNA  prepared  from  abnormal  area  was 
labeled  with  Cy5  and  normal  area  with  Cy3.  The  two 
sources  of  RNA  then  mixed  and  hybridized  with  Agilent 
Whole  Human  Genome  Oligo  Microarray.  The  measured 
intensity  ratio  between  the  two  fluorphores  thus 
represents  the  relative  expression  level  of  each  gene. 
Box  and  whisker  plots  were  generated  with  GeneSpring 
program. 
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Summary  of  Research  Findings 


Whole  genome  expression  profiling  microarray  analysis  produces  valuable  information  that  was 
not  previously  available,  but  is  noted  for  its  difficulty  in  obtaining  consensus  results  in  sample 
analysis.  To  minimize  the  sample  variation,  we  used  paired  bronchial  brush  samples  from  the 
normal  and  abnormal  areas  of  each  individual  to  perform  microarray  analysis.  To  date,  a  total  of 
14  sample  pairs  have  been  submitted  to  the  MDACC  microarray  core  facility  for  analysis.  The 
majority  of  sample  pairs  are  from  “white-light  abnormal”  area  vs.  “white-light  normal”  area, 
except  cases  k3  and  k4,  which  are  from  LIFE-abnormal  while  white-light  normal  area. 
Nevertheless,  there  is  no  obvious  disparity  among  these  samples  in  terms  of  their  general  gene 
expression  levels  (Fig.  1).  To  analyze  these  microarray  datasets,  we  first  examined  the 
commonly  regulated  genes  using  the  expression  level  of  all  the  sample  sets.  Sorting  the  genes 
by  their  average  expression  level  further  reveals  the  heterogeneity  of  the  samples  population 
and  revealed  the  presence  of  possible  false-  positive  results  in  the  sample  set  (Fig.  2).  To 
determine  which  sub-set  of  samples  might  be  falsely  positive  or  grossly  different  in  their  genetic 
makeup,  we  performed  an  unsupervised  classification  using  a  complete-linkage  hierarchical 
clustering  analysis.  The  analysis  distinctively  divided  the  samples  into  two  sample  clusters, 
which  we  designated  as  Cl  and  C2  (Fig.  3). 


■3.0  1:1  3.0 


43  43 

7  7  OT-NcntfLo 


Figure  2.  Heterogeneity  in  the  genome 
expression  profile  of  samples.  Only  gene 
expression  levels  with  significant  signal  ratio 
(p<0.05)  in  at  least  50%  of  the  microarray 
samples  are  included  (total  4816  probes  for 
3867  genes).  The  genes  with  expression  levels 
that  did  not  reach  significance  are  designated 
as  0,  for  no  change.  The  genes  were  sorted  in 
the  descending  order  of  average  expression 
level.  The  top  100  genes  are  shown.  Heat  map 
was  generated  with  Genesis  program. 


Fig  3.  Hierarchical  clustering  analysis  divided  all  the  brush 
samples  into  two  distinctive  sample  clusters.  The  probe  set 
meets  the  criteria  mentioned  in  Fig  2  was  used  to  perform  compete 
linkage  hierarchical  analysis.  Left:  portions  of  the  heat  map  were 
shown;  right:  top  portion  of  heat  map  was  enlarged  to  show  the 
sample  label. 


We  again  compared  these  two  clusters  by  examining  their  top  up-  and  down-regulated  genes 
separately.  However,  these  top  regulated  genes  still  show  significant  heterogeneity  within  each 
cluster  (Fig.  4)  and  they  further  suggested  that  our  measured  results  can  be  easily  affected  by 
some  outliers  with  unusual  expression  levels,  such  as  several  hemoglobin  genes.  The  presence 
of  outliers  in  the  sample  set  may  be  caused  by  the  different  amount  of  blood  retained  in  the  brush 
samples. 
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To  avoid  such  bias,  we  ran  a  K-means  clustering  analysis  within  each  sample  cluster,  and  set  the 
cluster  number  to  50  to  obtain  an  average  of  about  100  gene  probes  per  cluster  (Figs.  5  and  6). 
We  predicted  that  the  important  differentially  regulated  genes  should  be  consistently  up  or  down 
within  each  sample  cluster,  and  we  selected  only  the  gene  clusters  that  are  constantly  up-  or 
down-regulated  across  the  samples  (i.e.,  gene  clusters  28,  29  and  40  in  Cl;  gene  cluster  1,10, 
11,  46  and  47  in  C2)  for  further  analysis  as  genes  of  interest.  As  shown  in  Table  1  and  2,  these 
genes  of  interest  are  disparate  between  the  two  sample  clusters  with  slight  overlap  and  an  inverse 
correlation  in  certain  gene  clusters.  These  results  indicate  that  either  the  patients  from  these  two 
clusters  have  a  distinct  genetic  make-up  or  samples  from  one  of  the  clusters  are  simply  false 
positive,  providing  a  practical  contrast  for  the  other. 


Cl  top  up-  and  down-regulated  genes 
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Figure  4.  Top  up-  and  down-regulated  genes  in  the  two  sample  clusters.  The  gene 
expression  ratios  in  each  sample  cluster  were  averaged  for  each  gene,  and  then  sorted  in 

HpkQr.pnrlinn  nrrlpr 
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Figure  5.  The  centroid  view  of  the  50  clusters  resolved  with  k-means  clustering  analysis  for 
sample  cluster  Cl.  Gene  clusters  28,  29  and  40,  which  show  consistent  change  (down-regulated) 
across  the  samples  were  chosen  (highlighted). 

As  the  selected  gene  clusters  in  C2  consist  of  several  genes  that  were  previously  reported  to  be 
relevant  to  tumorigenesis,  we  extracted  these  genes  and  analyzed  the  potential  key  signal 
pathways  that  are  involved  with  a  Kegg  pathway  analysis  program.  The  pathway  analysis 
reveals  that  the  cytokine-cytokine  receptor  pathway  is  the  top  signal  pathway  involved,  followed 
by  other  tumorigenesis  pathways,  such  as  cell  migration  and  adhesion  (Table  3).  Due  to  our 
previous  suspicions  regarding  false-positive  Cl  samples,  we  used  these  two  clusters  to  run  a 
differential  expression  analysis  with  GEPAS  (Gene  Expression  Pattern  Analysis  Suite)  program 
(Fig.  7).  Genes  in  the  top  and  bottom  list  encompass  numerous  genes  that  were  previously 
reported  to  be  important  for  tumorigenesis.  Results  further  indicate  that  the  two  sample  clusters 
represent  two  distinct  populations;  thus,  the  two  sample  sets  need  to  be  analyzed  separately  to 
uncover  additional  information  useful  to  future  studies.  Microarray  analysis  of  the  sample 
clusters  may  provide  important  information  for  identifying  the  molecular  differences  between  the 
abnormal  and  normal  tissue  at  early  stage  of  tumorigenesis  in  future  studies. 
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Figure  6.  The  centroid  view  of  the  50  clusters  resolved  with  k-means  clustering  analysis  for 
sample  cluster  C2.  Gene  clusters  1,  10,  46,  47  (up-regulated),  and  11  (down-regulated)  which  show 
consistent  change  across  the  samples  were  chosen  (highlighted). 


In  conclusion,  whole  genome  expression  microarray  analysis  provides  tremendous  information 
about  the  molecular  differences  between  abnormal  and  normal  tissues.  However,  the  real  signal 
that  dictates  the  pathological  event  may  be  muffled  by  the  noises  from  mixed  sample 
population.  By  distinguishing  the  subsets  of  the  sample  population,  we  were  able  to  identify  the 
consistently  regulated  genes  and  the  signaling  pathway  that  is  critical  for  tumorigenesis. 
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PTPN7 

0.546 

URP2 

0.264 

RAB32 

0.461 

RIN3 

0.506 

SH2B3 

0.413 

SIGLEC9 

0.412 

SNAI3 

0.577 

TGFB1 

0.541 

TLR4 

0.404 

TMEM140 

0.555 

TNFAIP8L2 

0.470 

TNFSF13B 

0.496 

TRPM2 

0.481 

TSPAN33 

0.476 

UNC13D 

0.389 

VASP 

0.442 

Table  1.  The  gene  clusters  showing  consistent  regulation  of  expression  in  sample  cluster  Cl. 

Fold  change  is  the  average  relative  expression  level  of  all  samples  in  Cl . 
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Clusterl 

F-C 

Clusterl  0 

F-C 

Clusterl 1 

F-C 

Cluster46 

F-C 

Cluster47 

F-C 

ACP5 

2.41 

CCL2 

2.17 

ACTG2 

0.33 

AATK 

2.94 

ADAMTSL4 

3.19 

ADCY7 

3.54 

CCL3 

2.25 

COL6A2 

0.38 

AIF1 

2.27 

ADCY7 

3.54 

ADFP 

2.74 

CCL4 

2.25 

DES 

0.21 

AMICA1 

2.12 

AGRP 

3.50 

ADM 

2.69 

CCRL2 

3.16 

FLJ30901 

0.47 

AOAH 

2.09 

ALOX5AP 

3.53 

ALOX5 

2.72 

CXCL5 

3.78 

GREM2 

0.50 

AP1S2 

3.09 

AOC3 

2.76 

APOC2 

2.40 

CXCR4 

3.26 

MFAP4 

0.40 

APBB1IP 

2.36 

APOBEC3A 

5.82 

APOE 

2.76 

FCAR 

1.72 

MYH11 

0.24 

APOBEC3B 

2.67 

APOC1 

4.48 

C1QA 

1.96 

G0S2 

1.55 

PDLIM3 

0.29 

ARHGAP15 

2.30 

AQP9 

4.19 

C1QB 

2.25 

ILIA 

3.08 

TBX2 

0.28 

ARL11 

2.22 

BCAT1 

3.70 

C1QC 

2.17 

IL8 

1.58 

WISP2 

0.35 

BIN2 

2.07 

BCL2A1 

4.84 

C8A 

3.05 

MMP12 

3.06 

BTK 

3.20 

CA2 

5.06 

CATSPER1 

3.07 

ORM1 

1.59 

CCR7 

2.19 

CAMP 

4.20 

CCL3L3 

2.51 

OSM 

2.36 

CD300LF 

2.36 

CARD 12 

5.04 

CCRL2 

3.16 

PHACTR1 

2.12 

CD33 

2.78 

CCL18 

3.74 

CD109 

3.15 

PLA2G7 

3.09 

CD48 

2.40 

CCL23 

3.76 

CD274 

2.64 

SDS 

2.36 

CD53 

3.14 

CCR1 

3.20 

CD300C 

2.02 

SLC26A4 

1.74 

CLEC4A 

2.57 

CD163 

4.56 

CD44 

2.06 

SPINK1 

2.34 

COTL1 

2.22 

CD36 

3.10 

CD68 

2.54 

SPRR2C 

2.86 

CPM 

3.62 

CD52 

3.07 

CD69 

2.49 

TKTL1 

2.47 

CPVL 

2.29 

CLEC12A 

2.99 

CD83 

2.48 

CREB5 

2.66 

CLEC4D 

3.53 

CD86 

3.17 

CRT  AM 

3.21 

CLEC4E 

4.12 

CPM 

3.62 

CSF2RA 

2.42 

COLEC12 

4.07 

CXCL5 

3.78 

CXCL1 1 

3.97 

EBI2 

2.77 

DAB2 

2.39 

CYBB 

3.47 

EREG 

4.21 

DOCK1 1 

2.93 

DPEP2 

2.02 

FABP4 

3.98 

DOK2 

2.24 

EMR1 

3.61 

FGR 

3.99 

EMILIN2 

1.96 

EVI2B 

2.39 

FN1 

2.65 

EMP1 

2.26 

FCGR2A 

2.89 

FPRL1 

3.48 

FABP5 

2.13 

FCGR3A 

4.00 

GPA33 

4.62 

FAM89A 

2.65 

FCGR3B 

3.28 

GPNMB 

3.84 

FBP1 

2.14 

FCN1 

2.29 

HK3 

4.10 

FCER1G 

2.38 

FLI1 

2.45 

HP 

3.04 

FGR 

3.99 

FPR1 

2.89 

HPR 

3.25 

FLJ25416 

2.40 

FYB 

2.46 

HTRA4 

3.45 

HAVCR2 

2.98 

GLIPR1 

2.49 

ILIA 

2.99 

HMOX1 

2.72 

GLT1D1 

2.71 

IL1B 

3.08 

IFI30 

2.88 

GMFG 

2.77 

IL1RN 

3.51 

IL10RB 

2.60 

GNG2 

2.20 

INHBA 

4.36 

IRF8 

3.02 

GPR34 

2.16 

KIAA1212 

3.60 

ITGAX 

2.37 

GPR65 

3.32 

LPL 

2.95 

JAKMIP2 

2.67 

HCK 

2.58 

MARCO 

4.72 

KCNAB1 

2.68 

HK3 

3.04 

MCEMP1 

3.83 

KIAA1212 

2.06 

IGSF6 

3.31 

MMP19 

6.19 

KMO 

2.50 

IL8RA 

1.94 

MMP9 

4.69 

LAIR1 

2.48 

IL8RB 

2.31 

MRC1L1 

4.70 

LAPTM5 

2.83 

ITGAM 

3.24 

MS4A4A 

4.07 

LILRB3 

2.90 

ITGB2 

3.30 

MS4A7 

3.27 

LY86 

2.56 

LAPTM5 

2.83 

MSR1 

3.82 

MYOIG 

2.05 

LAT2 

2.33 

OLR1 

4.96 

NPL 

2.38 

LCP1 

2.88 

PCOLCE2 

4.11 

OASL 

3.00 

LILRA2 

2.76 

PLAUR 

2.94 

OSCAR 

2.56 

LILRB2 

2.25 

PLEK 

3.24 

PAG1 

2.35 

LRRC25 

2.77 

PPARG 

3.35 

PARVB 

1.96 

LST1 

2.18 

PRG1 

2.94 

PBEF1 

2.16 

LTB 

2.13 

PROK2 

3.96 

PHLDA1 

2.10 

LY96 

2.82 

RETN 

3.09 

PLAU 

2.69 

MFNG 

2.09 

SLA 

3.70 

PTPRO 

2.75 

MMD 

2.54 

SLC11A1 

3.13 

RAB8B 

2.32 

MME 

3.47 

SLC2A14 

3.10 

RBP4 

2.68 

MNDA 

3.42 

SNX10 

4.55 

SAMSN1 

2.82 

MPEG1 

2.12 

SPP1 

4.58 

SAPS1 

2.32 

MPP1 

2.89 

SPRR1A 

5.84 

SCD 

2.45 

MS4A6A 

2.11 

SPRR3 

8.51 

SLAMF8 

2.43 

NCF1 

2.47 

STAC 

3.65 

SLC11A1 

3.13 

NCF4 

2.50 

TEX14 

2.98 

SLC02B1 

3.09 

PSCDBP 

2.21 

TM7SF4 

3.32 

TDRD9 

2.00 

PTCRA 

2.29 

TNFAIP6 

4.13 

THBD 

1.97 

PTGS1 

2.08 

TREM1 

3.55 

TIMP2 

2.08 

PTPRC 

2.35 

TUBB6 

3.48 

TLR4 

2.34 

QKI 

2.68 

VSIG4 

3.44 

TNF 

2.31 

RCSD1 

2.35 

W60781 

4.11 

TRPV2 

2.39 

RGS2 

2.68 

ZFHX1B 

3.49 

TYROBP 

2.65 

S100A8 

4.38 

UPP1 

2.87 

S100A9 

2.76 

VIM 

2.29 

SIGLEC5 

2.51 

VMD2 

2.24 

SLC24A4 

2.59 

TAGAP  2.26 

TBXAS1  2.15 

TLR8  3.53 

TREML2  2.45 

TUBB3  2.21 

WASPIP  2.36 


Table  2.  The  gene  clusters  showing  consistent  regulation  of  expression  in  sample  cluster  C2. 
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Pathway 

genes 

Cytokine-cytokine  receptor  interaction 

CCR1,  CCR7,  CSF2RA,  ILIA,  IL1B,  IL8,  IL8RA, 

IL8RB,  IL10RB,  INHBA,  LTB,  OSM,  CCL2,  CCL3, 

CCL4,  CCL18,  CCL23,  AL0X5,  PTGS1 ,  TBXAS1 . 

Toll-like  receptor  signaling  pathway 

LY96,  IL1B,  IL8,  TLR8,  CCL3,  CCL4,  CXCL11,  SPP1, 
TLR4,  TNFA,  CD86 

Leukocyte  transendothelial  migration 

CYBB,  ITGAM,  ITGB2,  MMP9,  NCF4,  NCF1,  CXCR4 

Cell  adhesion  molecules  (CAMs) 

CD274,  ITGAM,  ITGB2,  PTPRC,  CD86 

ECM-receptor  interaction 

COL6A2,  FN1 ,  SPP1 ,  CD36,  CD44 

Regulation  of  actin  cytoskeleton 

FN1,  ITGAM,  ITGAX,  ITGB2 

Focal  adhesion 

COL6A2,  FN1,  PARVB,  SPP1 

Arachidonic  acid  metabolism 

AL0X5,  PTGS1 ,  TBXAS1 

Table  3.  The  major  canonical  pathways 
sample  cluster  C2  are  involved. 

in  which  the  constantly  up-regulated  genes  from 

statistic 
7 . 3234143257 
6-7664794922 
6-5562791824 
6-3351411819 
5 -9572329521 
5 -9376091957 
5 -8323073387 
5-7647166252 
5-7361717224 
5 -7274522781 
5-7251210213 
5-6843090057 
5-6471033096 
5 .6288752556 
5 .6263937950 
5 .5883936882 
5-5694446564 
5-5421891212 
5-5278983116 
5 .5275263786 
5-5044445992 
5.4818964005 
5 .4634337425 
5.4119162560 
5.3929581642 
5.2870187759 
5 .2857708931 
5 -2763895988 
5 .2487106323 
5.2219042778 
5.2161178589 
5 .2037758827 
5.1050877571 
5 -0887007713 
5 .0348491669 
5 .0271506310 
4 .9678120613 
4 .9524722099 
4 .9237895012 
4 .9194140434 
4 .9154000282 
4 .9138512611 
4 .9133400917 
4 .9036750793 
4 .8973913193 
4 .8924212456 
4 .8860707283 
4 .8811893463 
4 .8601655960 
4 .8511595726 
-4.5730895996 
-4.5732049942 
-4.5746822357 
-4  .5858492851 
-4.6565752029 
-4.6713910103 
-4.6824464798 
-4.6990218163 
-4 .6993103027 
-4.7128019333 
-4.7553620338 
-4  .7739839554 
-4.7794003487 
-4.7983784676 
-4.8068699837 
-4 .8153424263 
-4 .8294210434 
-4  .8516631126 
-4 .8659214973 
-4 .8783845901 
-4  .9086585045 
-4.9307947159 
-4.9345946312 
-4.9609160423 
-5.0176310539 
-5 .0236759186 
-5.0415902138 
-5.0680341721 
-5  .1780481339 
-5  .1853256226 
-5.2295022011 
-5.3874702454 
-5  .5843539238 
-5  .6658916473 
-6 .0589261055 
-6.2066740990 
-6 .3766674995 
-6  .4928703308 
-6  .7279086113 
-6 .7852354050 
-6  .7939572334 
-7 .0575137138 
-7 .0911927223 
-7 .1246404648 
-7.3224148750 
-9.8592510223 
-13 .0044622421 
-13 .6741333008 
-17  .1544799805 
-19 .2854194641 


FLJ 40432 
TM6SF1 
DOK3 
CKLF 

CR611712 

SERPINB8 

ARL11 

MOSPD2 

JARID2 

MGC23909 

TNFSF13B 

CXCL11 

SYK 

RIT1 

SYK 

BID 

TNFAIP6 

VASP 

ZCSL2 

CCDC109B 

CD58 

ENST 00000370395 

MICB 

SLAMF8 

SYK 

BID 

PSMA6 

INCA 

ClOorf 125 

ClOorf 125 

OSBPL8 

CD69 

PTAFR 

LILRB5 

FFAR2 

PAG1 

NET02 

SELPLG 

BF213738 

OSTM1 

CLEC4A 

GPX1 

I  COS 

CYBB 

SYK 

PTAFR 

FAN49B 

URP2 

C20orf 24 

ENST 00000309178 

AA495894 

EGFL9 

HMCN1 

CCND1 

THC2343678 

FOXJ1 

WNT5B 

A_24_P174353 

CCDC8 

DMRTC1 

GLI4 

ABCC6 

DFNB31 

WFDC6 

C9orf 98 

FBX017 

LRRC61 

TMPRSS3 

CX3CL1 

KIAA1602 

SIRT3 

ENST 00000382327 

SELM 

DERL3 

SERPINF1 

ZBTB20 

AF339799 

WNK2 

ABCA3 

ENST 00000349637 

C1QTNF5 

FMOD 

FLJ 12993 

RND2 

NFIX 

C7orf 26 

FXYD1 

ERBB2 

L0C400451 

GPR98 

TNKS1BP1 

GDPD5 

LTBP3 

FLJ 42875 

THC2283783 

TGFB3 

ITGA3 

ATOH8 

ENST 00000307662 
THC2403217 


Figure  7.  The  top  and  bottom 
differentially  expressed  genes 
between  the  sample  cluster  Cl  and 
C2.  The  analysis  and  heat  map  are 
performed  and  generated  with 
GEPAS  (Gene  Expression  Pattern 
Analysis  Suite)  program. 


The  sea 1 e  is  presented  in  si gma  un i ts . 
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Aim  2  Establish  an  organotypic  model  system  that  mimics  in  vivo  interactions  between 
normal,  premalignant,  and  malignant  bronchial  epithelial  cells  in  the  lung  using 
cells  derived  from  bronchial  biopsies  and  immortalized  bronchial  cells. 

Our  prior  studies  using  chromosome  in  situ  hybridization  to  visualize  genetic  changes  in  the 
bronchial  epithelium  of  current  and  former  smokers  suggested  that,  over  years  of  tobacco 
smoke  exposure,  the  combination  of  accumulating  genetic  damage,  ongoing  tissue  damage, 
and  wound  healing  results  in  a  mosaic  of  evolving  clonal  outgrowths  throughout  the  bronchial 
epithelium.  To  better  understand  the  molecular  basis  of  preferential  outgrowth  of  more 
advanced  bronchial  epithelial  clones,  we  proposed  to  utilize  a  cell  culture  model  whereby 
normal  and  abnormal  bronchial  epithelial  cells  are  grown  on  collagen  or  stromal  cell-coated, 
suspended  filters  and  exposed  to  an  air-liquid  interface.  This  organotypic  culture  environment 
mimics  lung  stratified  epithelium,  complete  with  basal  cells,  ciliated  columnar  cells,  and  mucus- 
producing  goblet  cells.  Our  group  has  extended  this  model  system  by  tagging  cell  populations 
with  fluorescent  probes  (e.g.,  green  fluorescence  protein,  or  GFP)  that  allows  us  to  carry  out  live 
cell  imaging  of  mixed  clonal  populations.  This  model  system  permits  characterization  of  the 
ability  of  more  advanced  bronchial  epithelial  cell  populations  to  expand  on  the  growth  surface  at 
the  expense  of  less  advanced  bronchial  epithelial  cell  populations. 

Summary  of  Research  Findings 

Our  long-term  plan  associated  with  this  Specific  Aim  was  to  obtain  biopsy  specimens  from 
individuals  participating  in  the  clinical  trial  of  Project  1  and  to  compare  the  differential  growth 
properties  in  organotypic  cultures  of  bronchial  epithelial  cells  derived  from  LIFE  bronchoscopy 
positive  regions  to  specimens  derived  from  negative  regions.  Because  of  the  delay  in  the 
initiation  of  the  clinical  trial  and  the  limited  number  of  participants  in  the  trial,  we  initially  focused 
on  characterization  of  the  in  vitro,  organotypic  culture  growth  patterns  of  a  group  of  established 
bronchial  epithelial  cell  types,  including  commercially  available  normal  bronchial  epithelial  cells; 
cells  from  bronchial  biopsies  that  have  been  immortalized  (HCC-BE  cells)  with  transfection  of 
cyclin-dependent  kinase  4  (cdk4)  and  telomerase  (h-TERT)  by  our  colleagues,  Drs.  Minna  and 
Shay;  and  bronchial  epithelial  cells  derived  from  Ad12/SV40-transfected  bronchial  epithelial 
cells  (BEAS  2B)  and  subsequently  immortalized  in  vivo  (1799),  transformed  with  carcinogens 
(1 1 98),  and  then  selected  for  a  tumorigenic  phenotype  (1 1 70). 

In  previous  studies,  we  transfected  different  cell  lines  with  differing  living  color  probes  (e.g.,  GFP, 
RFP,  and  CFP)  and  characterized  their  relative  ability  to  grow  and  expand  at  the  expense  of 
their  competitor  cell  populations.  Initial  studies  used  fluorescence  microscopy  to  delineate  the 
spatial  domains  occupied  by  the  competing  cell  populations,  and  laser  scanning  microscopy 
was  used  to  determine  the  manner  in  which  one  cell  population  could  expand  at  the  expense  of 
its  neighboring  population.  More  recently,  we  examined  the  interface  dynamics  between 
competing  cell  populations  using  live  cell  imaging  and  were  able  to  directly  observe,  in  real  time, 
the  manner  in  which  one  cell  population  advances  at  the  expense  of  its  neighboring  colony. 

One  problem  that  we  face  in  carrying  out  the  proposed  studies  with  fresh  bronchial  cell 
outgrowths  (obtained  from  biopsies  from  different  lung  sites  from  the  same  individual  and 
initiated  in  culture  by  Dr.  Koo’s  laboratory)  is  that  the  cells  will  not  contain  living  color  markers. 
Such  cells  are  notoriously  difficult  to  transfect  and  not  amenable  to  selection  using  antibiotic 
treatments.  We  therefore  initiated  the  development  of  lentiviral  infection  technologies  that  would 
provide  high  infection  rates  and  the  ability  to  express  genes  in  slowly  growing  cell  populations. 
Lentiviral  constructs  have  now  been  developed  that  will  allow  us  to  label  bronchial  epithelial 
cells  with  GFP,  YFP,  monoRFP,  or  CFP  in  freshly  generated  cultures.  We  have  acquired  a  new 
type  of  color  imaging  vector  that  contains  copies  of  red,  cherry,  YFP  and  blue  genes  on  the 
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same  vector  and  that  contain  lox  recombination  sequences  between  the  different  color 
sequences.  Transfection  of  cells  containing  this  multicistronic  vector  with  the  gene  for  the  ere 
recombinase  leads  to  recombination  and  the  generation  of  cells  with  differing  colors  (Fig  8). 


Figure  8.  Development  of 
multicolor  cell  populations  using 
the  Brainbow  vector  1.1.  The  cells 
are  first  stably  transfected  with  the 
Brainbow  vector  1.1  and  then 
transfected  with  Cre  recombinase  to 
generate  heterogeneity  of  color 
within  the  cell  population.  This 
approach  will  permit  direct,  real-time 
visualization  of  preferential 
subclonal  expansion  within  the  cell 
population. 


Outgrowths  of  the  resulting  subclones  can  then  be  visualized  and  quantified  using  our  spectral 
imaging  camera,  which  characterizes  the  spectral  characteristics  of  each  picture  element  in  the 
image  and  allows  us  to  distinguish  groups  of  cells  differing  in  color.  We  are  hopeful  this  new 
approach  will  allow  us  to  identify  and  selectively  isolate  more  aggressive  subpopulations  that 
exist  within  heterogeneous  cell  populations  and  characterize  the  specific  genetic  and  expression 
changes  associated  with  their  more  aggressive  behavior. 

Aim  3  Determine  the  mechanisms  of  genetic  instability  and  elucidate  the  signaling 
pathways  associated  with  clonal  outgrowth  of  premalignant  and  malignant 
bronchial  epithelial  cells  using  the  organotypic  model  system. 

Our  prior  studies  using  chromosome  in  situ  hybridization  to  visualize  genetic  changes  in  the 
bronchial  epithelium  of  current  and  former  smokers  suggested  that  current  tobacco  exposure 
was  associated  with  increased  levels  of  ongoing  genetic  instability  (i.e.,  chromosome 
polysomy).  Upon  smoking  cessation,  while  the  initiated  clonal  outgrowths  appeared  to  be 
maintained  over  tens  of  years,  the  levels  of  ongoing  genetic  instability  appeared  to  decrease 
gradually  during  the  first  year  following  smoking  cessation.  However,  in  some  cases,  we 
observed  evidence  for  ongoing  genetic  instability  in  the  bronchial  epithelial  cells  even  10-20 
years  following  smoking  cessation.  Since  nearly  half  of  the  newly  diagnosed  lung  cancer  cases 
occur  in  former  smokers,  we  felt  that  this  finding  suggested  that  an  ongoing  intrinsic  process  of 
genetic  instability  might  exist  in  the  lungs  of  some  former  smokers  that  drives  continued  genetic 
evolution  toward  lung  cancer  even  after  cessation  of  extrinsic  carcinogenic  exposure. 

Our  working  hypothesis  is  that  years  of  tobacco  exposure  induces  a  chronic  damage  and  wound 
healing  cycle  that  results  both  in  the  accumulation  of  genetic  alterations  in  the  epithelial  cells 
that  influences  both  chromosome  stability  mechanisms  (e.g.,  loss  of  cell  cycle  checkpoint  and 
cell  loss  mechanisms  through  loss  of  pi 6  expression,  p53  mutations,  cyclin  D1  overexpression, 
etc)  and  creates  a  poor  growth  environment  (e.g.,  altered  stromal  signals).  The  goal  of  this 
specific  aim  was  to  utilize  the  lung  organotypic  model  to  address  this  hypothesis  in  vitro  utilizing 
bronchial  epithelial  cells  derived  from  LIFE  bronchoscopically  identified  “abnormal”  and  “normal” 
regions  of  the  lung  of  current  and  former  smokers  participating  in  the  clinical  trial  of  Project  1 . 
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Summary  of  Research  Findings 

Previous  data  suggested  that  ongoing  genetic  instability  could  be  detected  in  the  bronchial 
epithelial  cell  populations  of 
both  current  and  former 
smokers  leading  to  the 
development  of  subclonal 
outgrowths.  When  grown  in 
three-dimensional,  lung 
organotypic  cell  cultures,  we 
found  that  bronchial 
epithelial  cells  exhibited 
genetic  instability  in  the  form 
of  errors  at  mitosis,  including 
chromosome  breaks,  lagging 
chromosome  fragments  at  anaphase, 
chromosome  bridges,  multipolar 
spindles,  and  development  of  bi-  and 
trinucleate  cells  due  to  failed 

completion  of  cell  separation  following 
mitosis  (Fig.  9).  We  noted  that  the 
levels  of  genetic  instability  increased  when  cells  proliferate  away  from  the  basal  layer, 
suggesting  dysregulation  of  cell  cycle  control  when  cells  try  to  proliferate  in  inappropriate  spatial 
regions.  We  also  showed  that  mitotic  instability  was  associated  with  increased  expression  of 
stress-response  proteins,  including  phosphorylated  histone  H2AX. 


Recent  data  suggests  that  genetic 
instability  can  occur  when  cells  try  to 
undergo  mitosis  without  proper  spatial 
directionality;  the  orientation  of  mitosis 
may  be  regulated  by  its  local  physical 
environment.  In  our  organotypic  culture 
system,  we  noted  that  normal  bronchial 
epithelial  cells  preferentially  undergo 
mitosis  at  the  basal  layer  of  the  culture 
and  the  orientation  of  the  mitotic 
apparatus  is  generally  parallel  to  the 
basal  layer,  perhaps  associated  with  the 
attachment  to  the  substratum  (Fig.  10).  However,  we  noted  that  cells  further  along  the  multi-step 
tumorigenesis  pathway  tended  to  divide  more  frequently  away  from  the  basement  membrane 
and  their  mitotic  spindle  orientation  relative  to  the  plane  of  the  basal  layer  was  highly  variable 
(Fig.  11). 


1 

j 

Figure  10.  Example  of  a  normal  human  bronchial 
epithelial  cell  in  anaphase  with  its  spindle  orientation 
parallel  to  the  plane  of  the  basal  layer 

Figure  9.  Examples  of  ongoing  genetic  instability  in 
bronchial  epithelial  cells  in  organotypic  cultures.  The  figure 
on  the  left  shows  an  anaphase  cell  containing  a  bridge  and  a 
lagging  fragment.  The  figure  on  the  right  shows  a  tripolar 
anaphase.  The  mitotic  figures  are  stained  with  propidium  iodide, 
which  gives  a  red-orange  fluorescence. 


Hypotenuse 


^  Basal  layer 


Figure  11.  Example  of  an  immortalized  human  bronchial 
epithelial  cell  in  anaphase  with  its  spindle  orientation  at 
an  angle  of  dz  relative  to  the  plane  of  the  basal  layer. 


To  better  quantify  this  abnormal  phenomenon,  we  quantitatively 
analyzed  images  of  the  three  dimensional  cultures  to  determine 
the  relationship  between  changes  in  mitotic  orientation  relative 
to  the  basal  layer  plane  and  the  frequencies  of  mitotic  errors 
(Fig.  12).  We  found  that  as  the  cultures  fashioned  toward  the 


32 


Army  Award  W81XWH-04-1-0142;  Waun  Ki  Hong,  M.D. 

Annual  Report:  Reporting  Period  15  December  2007  -  14  December  2008 


HCCICombined 


HCC2Combined 


!8 


J _ •  _ 

•  •  •  * 

_ 

« 

• 

i* 

S» 

• 

tm 

• 

•• 

< 

* .  *• 

) 

•  « 

l 

•  •  —* 

•  •  • 

• 

•  * 

• 

» • 

• 

• 

• 

•  * 

•  • 

•#  • 

••  •• 

• 

BEAS-2B 


• 

• 

• 

• 

• 

• 

• 

• 

• 

* 

». 

•  i 

< 

► 

i 

•  * 

• 

« 

• 

• 

•  • 
•  • 

• 

•• 

1799 


1198 


• 

• 

_ • _ 

• 

« 

L  « 

• 

• 

►  .. 

4 

•  C 
• 

• 

• 

•  •< 
*• 

•! 

• 

• 

v* 

• 

r v* 

• 

• 

•  V  1 
•  •••  • 

\  * 

t-c*  i 

m 


\ — 


Figure  13.  Graphical  representation  of  the  relationship  between  the  distance  of  the  anaphase  of  the  anaphase 
from  the  basal  layer,  the  orientation  of  the  mitotic  spindle  relative  to  the  plane  of  the  basal  layer,  and  the 
presence  of  mitotic  abnormalities  in  bronchial  epithelial  cells  at  various  stages  of  the  multistep  tumorigenesis 
process  cultured  in  organotypic  cultures.  HCC  1, 2  and  3  cells  represent  normal  human  bronchial  epithelial  (NHBE) 
cells  that  have  been  immortalized  with  cdk4  and  hTERT.  Beas  2B  cells  represent  NHBE  cells  that  have  been 
immortalized  with  SV40  large  T  antigen.  1799  and  1198,  cells  represent  BEAS  2B  cells  that  have  been  treated  with 
carcinogen,  transformed,  and  further  along  the  multiistep  tumorigenesis  pathway.  11701  cells  are  BEAS  2B  cells  that 
have  been  treated  with  carcinogen  and  have  become  tumorigenic.  The  symbols  in  the  figures  are  same  as  those 
described  in  the  legend  of  Figure  12. 
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tumor  phenotype,  they  more  frequently  exhibited  mitoses  away  from  the  basal  layer  and  the 
orientation  angle  of  the  mitotic  spindle  (and  degree  of  mitotic  error)  increased  with  distance  from 
the  basal  layer  (Figs.  13  and  14).  This  result  supports  the  hypothesis  that  mitotic  directionality 
and  fidelity  depend  on  cell-cell  and  cell-substrate  interactions.  Cells  that  undergo  mitosis  in  an 
inappropriate  physical  context  may  improperly  orient 


their  mitotic  spindles  and  result  in  mitotic  error  and 
increased  levels  of  genetic  instability  (Fig  14).  We 
therefore  postulate  that  re-regulating  the  spatial 
location  of  mitotic  events  in  the  bronchial  epithelium 
may  decrease  the  rate  of  ongoing  genetic  instability  and  perhaps  delay  cancer  onset  in 
individuals  with  increased  lung  cancer  risk.  This  organotypic  culture  model  might  be  useful  in 
the  detection  of  potential  chemopreventive  agents  that  can  reregulate  the  spatial  patterns  of 
mitotic  events  in  the  bronchial  epithelium. 


NHBE 


Figure  12.  Graphical  representation  of  the 
relationship  between  the  distance  of  the 
anaphase  of  the  anaphase  from  the  basal 
layer,  the  orientation  of  the  mitotic  spindle 
relative  to  the  plane  of  the  basal  layer,  and 
the  presence  of  mitotic  abnormalities  in 
normal  human  bronchial  epithelial  cells  in 
an  organotypic  culture.  The  Y  axis  represents 
the  distance  of  the  anaphase  from  the  basal 
layer.  The  color  of  the  symbol  represents  the 
absence  (blue)  or  presence  of  an  abnormal 
anaphase,  including  lagging  chromosomes 
(red),  chromosome  bridges  (green),  or 
multipolar  spindles  (purple).  The  size  of  the 
symbol  represents  the  angle  of  the  mitotic 
spindle  relative  to  the  plane  of  the  basal  layer. 
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Figure  14. 
Relationship 
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the  relative 
angle  of  the 
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Aim  4  Characterize  the  impact  of  chemopreventive  and/or  chemotherapeutic  agents  on 
early  lung  tumorigenesis  events  in  reconstructed  bronchial  epithelium  and  in  the 
bronchial  biopsies  of  subjects  entered  onto  the  clinical  trials  in  Project  1. 

The  goals  of  the  first  three  specific  aims  of  this  project  are  essentially  to  develop  and  utilize  the 
lung  organotypic  culture  model  to  identify  the  factors  that  control  ongoing  clonal  expansion  and 
genetic  instability  in  the  lungs  of  current  and  former  smokers.  The  idea  behind  this  fourth 
specific  aim  is  to  integrate  the  information  garnered  from  the  first  three  specific  aims  to  identify 
targeted  strategies  to  slow  preferential  outgrowth  of  more  advanced  bronchial  epithelial  cells 
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and  to  decrease  the  levels  of  ongoing  genetic  instability.  We  also  proposed  to  determine 
whether  treatment  of  these  organotypic  cultures  with  the  chemopreventve  agents  used  in  the 
clinical  trial  of  Project  1  would  slow  these  aberrant  properties  in  vitro  and  whether  results 
obtained  in  the  organotypic  culture  model  reflected  that  seen  in  the  lungs  of  the  participants  in 
the  clinical  trial. 

Summary  of  Research  Findings 

This  aspect  of  the  project  is  still  in  development.  We  have  generated  lentiviral  vectors  for 
inducing  living  color  probes  in  primary  bronchial  epithelial  cells  to  examine  clonal  outgrowths. 
We  have  also  generated  lentiviral  vectors  for  color-marked  histone  H2B  that  will  permit  us  to 
directly  monitor  mitotic  fidelity  in  real  time  in  organotypic  cultures  of  bronchial  epithelial  cells, 
and  have  developed  lentiviral  vectors  containing  either  color-marked  cyclin  Dla  or  cyclin  Dlb 
that  will  allow  us  to  determine  the  relative  impact  of  cyclin  D1  isoform  overexpression  on  genetic 
instability.  Parallel  studies  in  immortalized  oral  epithelial  cells  have  demonstrated  that  cyclin  D1 
overexpression  induces  various  types  of  mitotic  instability  including  increased  frequencies  of 
chromosome  bridges  and  lagging  chromosome  fragments  and  generation  of  binucleate  cells 
associated  with  incomplete  cell  separation  at  the  end  of  mitosis.  Dr.  Koo  has  already 
established  more  than  14  pairs  of  cell  strains  derived  from  bronchial  biopsies  of  fluorescence- 
normal  and  -abnormal  regions  of  participants  in  the  prevention  trial  of  Project  1 .  These  strains 
are  being  stored  in  the  frozen  state,  and  further  studies  will  be  undertaken  after  appropriate 
lentiviral  particles  are  generated  for  infection  of  these  cells. 

Aim  5.  Identify  gene  expression  signatures  in  epithelial  cells  detected  by  LIFE 
bronchoscopy  that  determine  aggressiveness. 

This  new  aim  under  the  ReVITALization  plan  will  be  performed  in  conjunction  with  Dr.  Wistuba, 
Core  C  Director.  Our  preliminary  results  from  our  research  in  VITAL  have  shown  that  epithelial 
cells  isolated  from  bronchial  biopsies  of  LIFE-abnormal  mucosa  can  be  characterized  as  more 
aggressive  (invasive  and  migratory)  than  those  of  LIFE-normal  mucosa.  Microarray  analysis 
suggested  that  several  CXCL-chemokine  signaling  pathways  are  mainly  deregulated  in  LIFE- 
abnormal  cells.  Moreover,  we  identified  that  pro-angiogenic  ELR+  (glutamic  acid,  lysine  and 
arginine  motif)  chemokines  were  strongly  upregulated  by  inflammatory  cytokines  in  lung  cancer 
cells. 

Summary  of  Research  Findings 

Inflammatory  cytokines  are  known  to  play  important  role  in  formation  of  new  blood  vessels.  The 
proinflammatory  cytokine  interleukin  (IL)-ip  has  been  reported  to  promote  tumor  development. 
We  reported  that  IL-ip  up-regulated  an  array  of  proangiogenic  CXC  chemokine  genes  in  the 
NSCLC  cell  line  A549  and  NCI-H1734,  as  determined  by  microarray  analysis.  Conditioned 
medium  from  IL-1  p-treated  A549  and  NCI-H1734  cells  markedly  increased  endothelial  cell 
migration.  The  migration  was  completely  suppressed  by  neutralizing  antibodies  against  CXCL5 
and  CXCR2.  We  also  found  that  IL-ip-induced  CXC  chemokine  gene  overexpression  in 
NSCLC  cells  was  abrogated  with  the  knockdown  of  two  transcription  factors,  CREB  or  NF-kB. 
Moreover,  the  expression  of  the  CXC  chemokine  genes  as  well  as  CREB  and  NF-kB  activity 
was  greatly  increased  in  the  tumorigenic  NSCLC  cell  line  compared  with  normal,  premalignant 
immortalized  or  nontumorigenic  cell  lines.  A  disruptor  of  the  interaction  between  CREB-binding 
protein  and  transcription  factors  such  as  CREB  and  NF-kB,  2-naphthol-AS-E-phosphate  (KG- 
501),  inhibited  IL-ip-induced  CXC  chemokine  gene  expression  and  angiogenic  activity  in 
NSCLC.  Further  studies  revealed  that  KG-501  induced  cell  cycle  arrest  and  apoptosis  of  lung 
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cancer  cell  lines.  Based  on  these  findings,  we  proposed  that  targeting  CREB  or  NF-kB  using 
small-molecule  inhibitors,  such  as  KG-501,  holds  promise  as  a  preventive  and/or  therapeutic 
approach  for  lung  cancer. 

Genes  in  the  EGFR-MAPK  pathway  have  been  reported  as  being  abnormally  regulated  in 
abnormal  bronchial  epithelial  cells  obtained  from  bronchial  brush  specimens  guided  by  LIFE. 
During  the  development  of  squamous  cell  carcinoma  (SCC)  in  the  lung,  bronchial  epithelial  cells 
exhibit  a  progressive  series  of  morphologically  distinct  changes:  hyperplasia,  squamous 
metaplasia,  dysplasia,  carcinoma  in  situ,  and  finally  invasive  SCC.  Here,  we  investigated 
molecular  mechanisms  involved  in  the  initiation  of  the  abnormal  differentiation,  namely 
hyperplasia,  of  bronchial  epithelial  cells.  We  demonstrated  that  ErbBI  ligands,  including 
epidermal  growth  factor  (EGF),  TGF-alpha,  and  amphiregulin,  completely  disrupted  apical-basal 
polarity  and  induced  hyperplasia  of  normal  human  tracheobronchial  epithelial  (NHTBE)  cells. 
EGF-induced  hyperplasia  was  completely  blocked  by  an  EGFR  inhibitor,  erlotinib,  and  MEK1/2 
inhibitor,  U0126,  suggesting  involvement  of  MEK-ERK  signaling.  Further  studies  showed  that 
EGF  substantially  upregulated  cyclin  D1,  and  that  these  inhibitors  completely  blocked  the 
upregulation.  Promoter  analysis  of  cyclin  D1  revealed  that  Ap-1  transcription  factor  regulates  the 
overexpression  of  cyclin  D1.  Depletion  of  AP-1  component  c-Jun  using  siRNA  completely 
abrogated  EGF-induced  cyclin  D1  expression  and  also  inhibited  EGF-induced  hyperplasia  in 
NHTBE  cells.  In  conclusion,  we  showed  that  EGF  induced  hyperplasia  of  primary  bronchial 
epithelial  cells  and  AP-1  plays  a  crucial  role  in  lung  carcinogenesis.  The  results  were  published 
in  a  recent  issue  of  Cancer  Prevention  Research  (October  2008). 

Key  Research  Accomplishments 

•  Completed  processing  of  14  pairs  of  patient  samples  and  obtained  the  microarray  data 
sets. 

•  Identified  two  distinct  populations  among  patient  samples,  which  has  helped  us  to  avoid 
the  confounding  effect  of  a  mixed  sample  population  and  enabled  us  to  find  consensus 
in  gene  regulation  patterns. 

•  Identified  the  cytokine-cytokine  receptor  pathway  as  one  of  the  major  signal  pathways 
that  may  dictate  the  differences  between  the  white-light  or  LIFE  abnormal  tissue  and 
normal  tissue,  and  could  be  critical  in  the  early  development  of  lung  cancer. 

•  Located  panels  of  genes  that  are  unanimously  regulated  in  each  distinct  sample  cluster, 
and  found  that  several  of  these  genes  coincide  with  genes  previously  reported  to  be 
important  for  tumorigenesis. 

•  Demonstrated  that  cells  further  along  the  multi-step  tumorigenesis  pathway  tended  to 
divide  more  frequently  away  from  the  basement  membrane  and  their  mitotic  spindle 
orientation  relative  to  the  plane  of  the  basal  layer  was  highly  variable. 

•  Confirmed  that  as  cultures  fashioned  toward  the  tumor  phenotype,  they  more  frequently 
exhibited  mitosis  away  from  the  basal  layer  and  the  orientation  angle  of  the  mitotic 
spindle  (and  degree  of  mitotic  error)  increased  with  distance  from  the  basal  layer. 

•  KG501,  a  small  molecule  inhibitor  targeting  CREB  activity,  and  neutralizing  antibodies 
against  CXCL5  and  CXCR2  blocked  the  migration  of  vascular  endothelial  cells  induced 
by  inflammatory  cytokine. 

•  Demonstrated  that  EGFR  ligands  induce  hyperplasia  of  bronchial  cells  cultured  in  an 
organotypic  3-dimensional  culture  method. 

•  Demonstrated  that  cyclin  D1  upregulated  by  API  transcription  factor  plays  a  critical  role 
in  the  hyperplasia  of  the  bronchial  epithelial  cells  and  that  erlotinib  blocks  the 
hyperplasia. 
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Conclusion 

•  Cytokine-cytokine  receptor  pathway  was  identified  as  one  of  the  major  signal  pathways 
that  may  dictate  the  differences  between  the  white-light  or  LIFE  abnormal  tissue  and 
normal  tissue,  and  hence  could  play  a  critical  role  in  the  early  development  of  lung 
cancer. 

•  KG501,  a  small  molecule  inhibitor  targeting  CREB  activity,  and  neutralizing  antibodies 
against  CXCL5  and  CXCR2  hold  promise  as  preventive  and/or  a  therapeutic  strategy  for 
lung  cancer. 

•  Demonstrated  that  re-regulating  the  spatial  location  of  mitotic  events  in  the  bronchial 
epithelium  may  decrease  the  rate  of  ongoing  genetic  instability  and  perhaps  delay 
cancer  onset  in  individuals  with  increased  lung  cancer  risk. 

•  EGFR  ligands  induce  bronchial  hyperplasia,  which  can  be  inhibited  by  erlotinib. 


Project  4:  Modulation  of  Death  Receptor-Mediated  Apoptosis  for  Chemoprevention 

(Project  Leader  and  co-leader:  Shi-Yong  Sun,  Ph.D.;  Fadlo  R.  Khuri,  M.D.) 

The  objective  of  Project  4  is  to  understand  the  role  of  death  receptor  (DR)-mediated  apoptotic 
pathways  in  lung  carcinogenesis,  cancer  prevention,  and  therapy  in  order  to  develop 
mechanism-driven  combination  regimens  by  modulating  DR-mediated  apoptosis  for 
chemoprevention  and  therapy  of  lung  cancer.  Following  is  a  summary  of  our  research  progress: 

Aim  1:  To  determine  whether  decoy  receptor  (DcR)  and  tumor  necrosis  factor-related 
apoptosis-inducing  ligand  (TRAIL)  expression  are  reduced  or  lost  while  DR  remains 
largely  expressed  and  whether  procaspase-8  and  FLIP  expression  and  Akt  activity  are 
increased  during  lung  carcinogenesis. 

Summary  of  Research  Findings 

c-FLIP  is  the  major  inhibitor  of  death  receptor-mediated  extrinsic  apoptotic  pathway.  We 
determined  the  expression  patterns  of  c-FLIP  in  a  panel  of  cell  lines  including  immortalized 
normal,  premalignant  and  malignant  (i.e.,  non-small  cell  lung  carcinoma;  NSCLC)  bronchial 
epithelial  cell  lines.  Both  long  and  short  forms  of  c-FLIP  were  highly  expressed  in  all  of  the 
malignant  NSCLC  cell  lines  (i.e.,  1170-1,  HI 57,  A549,  HI 299,  HI 792,  and  HCC827)  and  the 
premalignant  line  1198  that  was  exposed  to  tobacco  carcinogens;  in  contrast,  the  normal  cell 
lines  (HBEC3KT  and  BEAS-2B)  and  the  premalignant  cell  line  1799  that  was  not  exposed  to 
tobacco  carcinogens  expressed  very  low  or  undetectable  levels  of  c-FLIP  (Fig.  1).  Death 
receptor  5  (DR5),  as  we  speculated,  is  expressed  in  both  normal  and  malignant  lung  cancer  cell 
lines.  Bcl-2,  a  major  inhibitor  of  the  mitochondria-mediated  intrinsic  apoptotic  pathway,  was  also 
expressed  in  normal  cell  lines  and  the  majority  of  NSCLC  cell  lines  (Fig.  1).  Together,  these 
results  suggested  that  c-FLIP  may  play  an  important  role  in  regulation  of  the  development  of 
lung  cancer  and  may  serve  as  a  therapeutic  target  for  lung  cancer. 
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Figure  1.  Detection  of  c-FLIP,  DR5  and  Bcl-2 
expression  in  normal  and  transformed  bronchial 
epithelial  cell  lines.  Whole  protein  lysates  were 
prepared  from  the  indicated  cell  lines  and  then 
subjected  to  detection  of  the  given  protein  by  Western 
blot  analysis.  NS,  non-specific  spot. 


We  thus  used  c-FLIP  small  interfering  RNA  (siRNA)  to  specifically  downregulate  the  expression 
levels  of  c-FLIP  (both  long  and  short  forms)  and  determined  its  impact  on  apoptosis. 
Knockdown  of  c-FLIP  substantially  induced  cleavage  of  caspase-8,  caspase-9,  caspase-3  and 
PARP  (Fig.  2A),  and  increased  annexin  V-positive  cells  (apoptotic  cells)  (Fig.  2B).  These  results 
clearly  indicate  that  downregulation  of  c-FLIP  alone  is  sufficient  to  trigger  apoptosis,  further 
supporting  that  c-FLIP  is  a  promising  lung  cancer  therapeutic  target. 


We  are  currently  identifying  the  optimal  c-FLIP  antibody  for  IHC  so  that  we  can  confirm  our 
findings  of  c-FLIP  overexpression  in  human  NSCLC  specimens  by  IHC. _ 
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Figure  2.  Downregulation  of  c-FLIP  induces  apoptosis 
of  lung  cancer  cells.  A549  cells  were  transfected  with 
control  (Ctrl)  or  c-FLIP  siRNA  for  48  h  and  then  subjected 
to  preparation  of  whole  cell  protein  lysates  for  detection  of 
the  indicated  proteins  by  Western  blotting  (A)  or  to  make 
single  cell  suspensions  for  detection  of  apoptosis  using 
annexin  C  staining  (B).  CF,  cleaved  fragments. 


Aim  2:  To  establish  TRAIL-resistant  cell  lines  from  a  TRAIL-sensitive  lung  cancer  cell 
line  and  determine  whether  levels  of  DcRs,  DRs,  procaspase-8,  TRAIL  and  FLIPs  and  Akt 
activity  are  altered  and  are  associated  with  cell  resistance  to  TRAIL  and  DR-inducing 
agents. 

As  noted  in  the  previous  report,  we  were  not  able  to  demonstrate  that  the  TRAIL-resistant  lung 
cancer  cell  lines  exhibited  cross-resistance  to  some  DR-inducing  agents.  Alternatively,  we  have 
focused  on  addressing  the  question  of  whether  these  agents  modulate  the  DR-mediated 
apoptotic  pathway  and,  if  so,  how  they  modulate  the  DR-mediated  apoptotic  pathway  and 
whether  the  modulations  impact  apoptosis  by  these  DR-inducing  agents. 

Summary  of  Research  Findings 

A.  CHOP-dependent  death  receptor  5  induction  is  a  major  component  of  SHetA2-induced 
apoptosis  in  lung  cancer  cells. 

The  flexible  heteroarotinoids  (Flex-Hets)  represent  a  novel  type  of  atypical  retinoids  lacking 
activity  in  binding  to  and  transactivating  retinoid  receptors.  Preclinical  studies  have 
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demonstrated  that  Flex-Hets  induce  apoptosis  of  cancer  cells  while  sparing  normal  cells,  and 
exhibit  anticancer  activity  in  vivo  with  improved  therapeutic  ratios  over  conventional  retinoid 
receptor  agonists.  Flex-Hets  have  been  shown  to  induce  apoptosis  through  activation  of  the 
intrinsic  apoptotic  pathway.  The  present  study  has  revealed  a  novel  mechanism  underlying 
Flex-Het-induced  apoptosis  involving  induction  of  death  receptor  5  (DR5).  The  representative 
Flex-Het  SHetA2  effectively  inhibited  the  growth  of  human  lung  cancer  cells  in  cell  culture  and  in 
mice.  SHetA2-induced  apoptosis,  which  could  be  abrogated  by  silencing  caspase-8  expression, 
indicate  that  ShetA2  triggers  a  caspase-8-dependent  apoptosis  (Fig.  3).  Accordingly,  SHetA2 
upregulated  DR5  expression  including  cell  surface  levels  of  DR5  and  augmented  tumor  necrosis 
factor-related  apoptosis-inducing  ligand  (TRAIL)-induced  apoptosis.  Importantly,  small 
interference  RNA  (siRNA)-mediated  blockade  of  DR5  induction  conferred  cell  resistance  to 
SHetA2-induced  apoptosis  as  well  as  SHetA2/TRAIL-induced  apoptosis  (Fig.  4).  These  results 
demonstrate  that  DR5  induction  is  a  key  component  of  apoptosis  induced  by  SHetA2  or  by 
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Figure  3.  Effects  of  SHetA2  on  cleavage  of  caspases  and  their  substrates  (A)  and  impact  of  caspase-8 
silencing  on  SHetA2-induced  caspase  cleavage  (8)  and  decrease  in  cell  survival  (C).  A,  The  indicated  cell  lines 
were  exposed  to  the  given  concentrations  of  SHetA2  for  30  h.  The  cells  were  then  harvested  for  preparation  of 
whole-cell  protein  lysates  and  subsequent  Western  blot  analysis  for  detecting  cleavage  of  caspases  and  their 
substrates.  B  and  C,  A549  cells  were  cultured  in  a  6-well  plate  and  the  next  day  transfected  with  control  (Ctrl)  or 
caspase-8  siRNA.  Twenty-four  hours  after  transfection,  the  cells  were  re-seeded  in  a  6-well  plate  or  in  a  96-well 
plate.  On  the  second  day,  the  cells  were  treated  with  DMSO  or  10  jliM  SHetA2  (in  the  6-well  plate)  (B)  or  with  the 
indicated  concentrations  of  SHetA2  (in  the  96-well  plate)  (C).  After  48  h,  the  cells  were  subjected  to  preparation  of 
whole-cell  protein  lysates  and  subsequent  Western  blot  analysis  (B)  or  to  an  estimation  of  cell  number  using  the  SRB 
assay  (C).  Data  in  C  are  mean  of  four  replicate  determinations.  Bars,  ±  SDs.  Pro-casp,  Pro-caspase;  CF,  cleaved 
fragment. 


SHetA2  combined  with  TRAIL.  SHetA2  exerted  CHOP-dependent  transactivation  of  the  DR5 
promoter.  Consistently,  SHetA2  induced  CHOP  expression,  which  paralleled  DR5  upregulation, 
whereas  siRNA-mediated  blockage  of  CHOP  induction  prevented  DR5  upregulation,  indicating 
CHOP-dependent  DR5  upregulation  by  SHetA2.  Collectively,  we  conclude  that  CHOP- 
dependent  DR5  upregulation  is  a  key  event  mediating  SHetA2-induced  apoptosis.  These  data 
have  been  published  in  Cancer  Res  (Lin  et  al.,  Cancer  Res  2008). 
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B.  Involvement  of  c-FLIP  downregulation  in  Flex-Het-induced  apoptosis  and 
enhancement  of  TRAIL-initiated  apoptosis  in  lung  cancer  cells. 

The  above  results  demonstrated  that  upregulation  of  DR5  plays  a  critical  role  in  the  mechanism 
of  SHetA2-induced  apoptosis  in  human  lung  cancer  cells.  The  hypothesis  of  this  study  was  that 
the  mechanism  of  SHetA2-induced  apoptosis  requires  modulation  of  additional  proteins  critical 
for  regulation  of  apoptosis,  including  c-FLIP,  survivin,  XIAP,  Bcl-2,  Bcl-XL,  Bax  and  Bim. 
Western  blot  analysis  demonstrated  that  c-FLIP  and  survivin  were  substantially  reduced  in  all  of 
the  tested  cell  lines  exposed  to  SHetA2  compared  to  other  proteins  that  were  reduced  only  in  a 
subset  of  the  cell  lines  tested  (Fig.  5).  Strikingly,  overexpression  of  c-FLIP,  but  not  survivin, 
protected  cells  from  SHetA2-induced  apoptosis  (Fig.  6)  and  enhancement  of  TRAIL-initiated 
apoptosis  although  knockdown  of  endogenous  survivin  did  slightly  sensitize  cells  to  SHetA2- 
induced  apoptosis.  Consistent  with  these  results,  small  interfering  siRNA-mediated  reduction  of 
c-FLIP  was  more  effective  than  survivin  downregulation  in  triggering  apoptosis  in  these  cell 
lines.  SHetA2  increased  ubiquitination  of  c-FLIP  and  the  consequent  degradation  was 
abrogated  by  the  proteasome  inhibitor  MG132.  Although  SHetA2  treatment  led  to  increased  c- 
Jun  phosphorylation,  the  JNK  inhibitor  SP600125  did  not  prevent  c-FLIP  downregulation  by 
SHetA2.  Thus,  it  appears  that  SHetA2  downregulates  c-FLIP  levels  by  facilitating  its 
ubiquitin/proteasome-mediated  degradation  independent  of  JNK  activation.  Collectively,  the 
present  study  indicates  that  c-FLIP  downregulation  is  another  important  component  of  Flex-  Het 
(SHetA2)-induced  apoptosis  as  well  as  enhancement  of  TRAIL-induced  apoptosis.  This  part  of 
the  results  has  been  published  in  Mol  Cancer  Ther  (Lin  et  al.,  Mol  Cancer  Ther  2008). 


Figure  4.  Blockage  of  DR5  induction  (4) 
attenuates  SHetA2’s  ability  to  activate 
caspases  (A),  decrease  cell  survival  (B), 
induce  apoptosis  (C)  and  augment  TRAIL- 
induced  apoptosis  (D)  in  human  NSCLC  cells. 
A  and  B,  The  indicated  cell  lines  were  cultured  in 
6-well  plates  and  the  next  day  transfected  with 
control  (Ctrl)  or  DR5  siRNA.  Twenty-four  hours 
after  the  transfection,  cells  were  reseeded  in  6- 
well  plates  (A)  or  96-well  plates  (B)  and  treated 
with  10  pM  SHetA2  (A)  or  the  indicated 
concentrations  of  SHetA2  (B).  After  48  h  (30  h 
for  A549  in  A),  the  cells  were  subjected  to 
preparation  of  whole-cell  lysates  and  Western 
blot  analysis  (A)  or  to  the  SRB  assay  for 
calculation  of  cell  survival  (B).  Data  in  B  are 
mean  of  four  replicate  determinations.  Bars,  ± 
SDs.  Pro-casp,  Pro-caspase;  CF,  cleaved 
fragment.  C  and  D,  HI 57  (C)  or  A549  (D)  cells 
were  cultured  in  6-well  plates  and  the  next  day 
transfected  with  control  (Ctrl)  or  DR5  siRNA. 
Forty-eight  hours  after  the  transfection,  the  cells 
were  exposed  to  DMSO  or  10  pM  SHetA2  for  48 
h  (C)  or  treated  with  DMSO  control,  10  pM 
SHetA2  alone,  20  ng/ml  TRAIL  alone  or  SHetA2 
combined  with  TRAIL  for  24  h  (D).  The  cells 
were  then  harvested  for  Annexin  V  assay  to 
detect  apoptosis.  The  percent  positive  cells  in 
the  upper  right  and  lower  right  quadrants  were 
added  to  yield  the  total  of  apoptotic  cells. 
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Figure  5.  Modulatory  effects  of  SHetA2 
on  c-FLIP,  survivin,  XIAP,  Bcl-2,  BcI-XLj 
Bim  and  Bax  in  human  NSCLC  cell  lines. 

The  indicated  cell  lines  were  treated  with  the 
indicated  concentrations  of  SHetA2  for  16  h 
(A)  or  with  5  jliM  SHetA2  for  the  given  time 
as  indicated  (B).  The  cells  were  then 
subjected  to  preparation  of  whole-cell 
protein  lysates  and  subsequent  Western  blot 
analysis. 


Figure  6.  Enforced  expression  of  ectopic 
c-FLIP  protects  cells  from  SHetA2- 
induced  cell  number  decrease  (A  and  B) 
and  apoptosis  (C  and  D).  A  and  B,  The 

indicated  transfectants  were  seeded  in  96- 
well  plates  and  treated  with  the  indicated 
concentrations  of  SHetA2  ranging  from  2.5 
to  10  jlxM .  After  48  h,  the  cells  were 
subjected  to  the  SRB  assay  for 
measurement  of  cell  number.  Data  are  the 
means  of  four  replicate  determinations. 
Bars;  ±  SDs.  C  and  D,  The  indicated 
transfectants  were  treated  with  10  jliM 
SHetA2  for  48  h  and  then  harvested  for 
detection  of  apoptotic  cells  using  Annexin  V 
staining.  The  percent  positive  cells  in  the 
upper  right  and  lower  right  quadrants  were 
added  to  yield  the  total  of  apoptotic  cells. 


C.  Coupling  of  endoplasmic  reticulum  (ER)  stress  to  CDDO-Me-induced  up-regulation  of 
death  receptor  5  via  a  CHOP-dependent  mechanism  involving  JNK  activation. 

The  synthetic  triterpenoid  methyl  2-cyano-3,12-dioxoolean-1,9-dien-28-oate  (CDDO-Me)  is  in 
Phase  I  clinical  trials  as  a  novel  cancer  therapeutic  agent.  We  previously  demonstrated  that 
CDDO-Me  induces  c-Jun  N-terminal  kinase  (JNK)-dependent  death  receptor  5  (DR5) 
expression  and  augments  death  receptor-induced  apoptosis.  The  current  study  focused  on 
addressing  how  CDDO-Me  induces  JNK-dependent  DR5  expression.  Analysis  of  DR5  promoter 
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Figure  7.  CDDO-Me  induces  CHOP- 
dependent  DR5  expression.  A)  The 

CDDO-Me  induces  dose-dependent 
DR5  expression.  The  indicated  cells 
lines  were  treated  with  the  indicated 
concentrations  of  CDDO-Me  for  12  h. 
B)  CDDO-Me  induces  time-dependent 
DR5  expression.  H460  and  HI 792  cells 
were  treated  with  1  jliM  CDDO-Me  for 
the  times  as  indicated.  C)  Blockade  of 
CHOP  induction  abrogates  CDDO-Me- 
induced  DR5  upregulation.  Both  HI 792 
and  H460  cells  lines  were  transfected 
with  control  (Ctrl)  or  CHOP  siRNA.  After 
48  h,  the  cells  were  treated  with  1  pM 
CDDO-Me  for  12  h  (H1792)  or  with  0.5 
pM  CDDO-Me  for  8  h  (H460).  After  the 
aforementioned  treatments  (A-C),  the 
cells  were  subjected  to  preparation  of 
whole-cell  protein  lysates  and 
subsequent  Western  blot  analysis.  D) 
Blockade  of  CHOP  induction  inhibits 
CDDO-Me-induced  increase  in  cell 
surface  DR5.  HI  792  cells  were 
transfected  with  control  (Ctrl)  or  CHOP 
siRNA.  After  48  h,  the  cells  were  treated 
with  1  pM  CDDO-Me  for  12  h  and  then 
subjected  to  staining  of  cell  surface 
DR5  and  subsequent  flow  cytometry. 
Ab,  antibody. 


C 


Figure  8.  Blockade  of  CHOP  induction 
diminishes  CDDO-Me’s  ability  to 
induce  apoptosis.  A)  HI 792  cells  were 
plated  in  6-well  cell  culture  plates  and 
transfected  with  control  (Ctrl)  or  CHOP 
siRNA.  After  48  h,  the  cells  were  treated 
with  the  1  pM  CDDO-Me  for  12  h  and 
then  subjected  to  flow  cytometric 
analysis  for  sub-GI  populations. 
Columns,  means  of  duplicate  assays; 
Bars,  ±  SE.  B  and  C)  H460  cells  were 
plated  in  6-well  cell  culture  plates  and 
transfected  with  control  (Ctrl)  or  CHOP 
siRNA.  After  48  h,  the  cells  were  treated 
with  the  0.5  pM  CDDO-Me  for  8  h  and 
then  subjected  to  preparation  of  whole¬ 
cell  protein  lysates  and  Western  blot 
analysis  (B).  In  addition,  the  cells  were 
also  harvested  for  annexin  V  staining 
and  flow  cytometric  analysis  of  apoptotic 
cells  (C).  The  percent  positive  cells  in 
the  upper  right  and  lower  right  quadrants 
were  added  to  yield  the  total  of  apoptotic 
cells. 
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regions  defines  that  the  CHOP  binding  site  is  responsible  for  CDDO-Me-induced  transactivation  of 
the  DR5  gene.  Consistently,  CDDO-Me  induced  DR5  expression  and  parallel  CHOP  upregulation. 
Blockade  of  CHOP  upregulation  also  abrogated  CDDO-Me-induced  DR5  expression  (Fig.  7).  These 
results  indicate  that  CDDO-Me  induces  CHOP-dependent  DR5  upregulation.  Moreover,  the  JNK 
inhibitor  SP600125  abrogated  CHOP  induction  by  CDDO-Me,  suggesting  a  JNK-dependent  CHOP 
upregulation  by  CDDO-Me  as  well.  Importantly,  knockdown  of  CHOP  attenuated  CDDO-Me- 
induced  apoptosis,  demonstrating  that  CHOP  induction  is  involved  in  CDDO-Me-induced  apoptosis 
(Fig.  8).  Additionally,  CDDO-Me  increased  the  levels  of  Bip,  phosphorylated  elF2a,  IREIa,  and 
ATF4,  all  of  which  are  featured  changes  during  endoplasmic  reticulum  (ER)  stress.  Furthermore, 
salubrinal,  an  inhibitor  of  ER  stress-induced  apoptosis,  inhibited  JNK  activation  and  upregulation  of 
CHOP  and  DR5  by  CDDO-Me  and  protected  cells  from  CDDO-Me-induced  apoptosis  (Fig.  9). 
Thus,  ER  stress  appears  to  be  important  for  CDDO-Me-induced  JNK  activation,  CHOP  and  DR5 
upregulation,  and  apoptosis.  Collectively,  we  conclude  that  CDDO-Me  triggers  ER  stress,  leading  to 
JNK-dependent,  CHOP-mediated  DR5  upregulation  and  apoptosis.  These  data  have  been 
published  in  Cancer  Res  (Zou  et  al.,  Cancer  Res  2008). 
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Figure  9.  CDDO-Me  increases  the  levels 
of  ER  stress  marker  proteins  (A)  and 
induces  ER  stress-mediated  apoptosis 
(B-D).  A)  The  indicated  cell  lines  were 
treated  with  1  jliM  CDDO-Me.  After  the 
indicated  times,  the  cells  were  harvested 
and  subjected  to  preparation  of  whole-cell 
protein  lysates  for  detection  of  the 
indicated  proteins  by  Western  blot 
analysis.  B)  The  indicated  cell  lines  were 
seeded  in  96-well  cell  culture  plates  and 
treated  the  next  day  with  DMSO,  1  juM 
CDDO-Me  alone,  the  given  concentrations 
of  salubrinal  (Sal)  alone,  and  the 
respective  combination  of  CDDO-Me  with 
each  of  the  given  concentrations  of 
salubrinal.  After  24  h,  the  cells  were 
subjected  to  the  SRB  assay  for  estimation 
of  cell  numbers.  Columns,  means  of  four 
replicate  determinations;  Bars,  ±  SDs;  *  P 
<  0.001  compared  with  CDDO-Me  alone 
by  one-way  ANOVA  analysis  using 
GraphPad  InStat  3  software  (GraphPad 
Software,  San  Diego,  CA).  C  and  D)  H460 
cells  were  seeded  in  6-well  plates  and 
treated  the  next  day  with  DMSO,  1  juM 
CDDO-Me  alone,  75  jliM  salubrinal  (Sal) 
alone,  and  their  combination.  After  6  h,  the 
cells  were  harvested  for  preparation  of 
whole-cell  protein  lysates  for  detection  of 
the  indicated  proteins  by  Western  blot 
analysis  (C)  or  for  detecting  annexin  V- 
positive  cells  using  flow  cytometry  (D). 
CFs,  cleaved  forms. 
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Aim  3:  To  determine  whether  suppression  of  PI3K/Akt  activity  sensitizes  premalignant 
and/or  malignant  airway  epithelial  cells  to  apoptosis  induced  by  DR-induced  agents  via 
enhancement  of  TRAIL/DR-mediated  mechanism. 

Summary  of  Research  Findings 

A.  Perifosine  inhibits  Akt/mTOR  signaling  through  facilitating  degradation  of  major 
components  in  mTOR  pathway  and  induces  autophagy. 

The  Akt  inhibitor,  perifosine,  is  an  alkylphospholipid  exhibiting  antitumor  activity  as 
demonstrated  in  both  preclinical  studies  and  clinical  trials.  This  activity  is  partly  associated  with 
its  ability  to  inhibit  Akt  activity.  It  has  been  shown  that  mTOR  axis  plays  a  critical  role  in 
regulation  of  cell  proliferation  and  survival,  primarily  through  functioning  both  downstream  and 
upstream  of  Akt.  The  current  study  reveals  a  novel  mechanism  by  which  perifosine  inhibits  Akt 
and  mTOR  axis.  In  addition  to  inhibition  of  Akt,  perifosine  inhibited  the  assembly  of  both 
mTOR/raptor  and  mTOR/rictor  complexes.  Strikingly,  perifosine  reduced  the  levels  of  Akt  and 
other  major  components  including  mTOR,  raptor,  rictor,  p70S6K,  S6,  4E-BP-1  and  Rhebl  in  the 
mTOR  axis  through  promoting  their  degradation  because  the  proteasome  inhibitor  MG132 
prevented  the  reduction  of  these  proteins  by  perifosine.  These  results  suggest  that  perifosine 
inhibits  the  mTOR  axis  through  degradation  of  major  components  in  this  axis,  showing  a 
different  mechanism  from  inhibition  of  mTOR  signaling  by  classical  mTOR  inhibitors  such  as 
rapamycin.  Moreover,  perifosine  substantially  increased  the  levels  of  type  II  LC3,  a  hallmark  of 
autophagy,  in  addition  to  increasing  PARP  cleavage,  suggesting  that  perifosine  induces  both 
apoptosis  and  autophagy  (Fig.  10).  Collectively,  we  conclude  that  perifosine  inhibits  mTOR 
signaling  and  induces  autophagy,  highlighting  a  novel  mechanism  accounting  for  perifosine’s 
anticancer  activity. 
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|  Figure  10.  Perifosine  increased  the  levels  of 
both  LC3-II  and  cleaved  PARP  (A  and  B),  which 
care  further  increased  when  co-treated  with 
perifosine  and  NH4CI  (C).  A  and  B,  The  indicated 
cell  lines  were  treated  with  the  given  concentrations 
of  perifosine  (PRFS)  for  8  h  (A)  or  with  10  jliM 
(HI 57)  or  5  jliM  (H460)  perifosine  for  the  given  times 
(B).  C,  H460  cells  were  pre-treated  with  20  mM 
NH4CI  for  30  min  and  then  co-treated  with  5  pM 
perifosine  for  additional  4  h.  After  the 

aforementioned  treatment,  the  cells  were  then 
harvested  for  preparation  of  whole  cell  protein 
lysates  and  subsequent  Western  blot  analysis  for 
detection  of  the  indicated  proteins. 
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Figure  11.  Inhibition  of  autophage  with 
lysosomal  inhibitors  enhances 
perifosine’s  apoptosis-inducing  activity. 

A.  The  indicated  cell  lines  were  seeded  on 
96-well  plates  and  on  the  second  day 
treated  with  the  given  concentrations  of 
perifosine  (PRFS)  alone,  CQ  or  NH4CI 
alone,  or  the  combination  of  CQ  with  PRFS 
or  NH4CI  with  PRFS.  After  48  h,  the  cells 
were  subjected  to  the  SRB  assay  for 
estimating  cell  numbers.  B  and  C,  H157 
cells  were  seeded  on  10  cm  diameter  cell 
culture  dishes  and  treated  with  on  the 
second  day  with  PRFS  alone  (5  or  10  uM), 
CQ  or  NH4CI  alone,  or  the  combination  of 
PRFS  with  CQ  or  NH4CI.  After  48h,  the 
cells  were  harvested  for  annexin  V  staining 
of  apoptotic  cells  (6)  or  for  Western  blot 
analysis  of  the  indicated  proteins  (C). 


It  has  been  suggested  that  autophagy  is  a  survival  mechanism  that  protects  cells  from  being 
killed  though  apoptosis.  Thus,  we  further  investigated  the  impact  of  autophagy  on  perifosine’s 
apoptosis-inducing  activity.  We  found  that  inhibition  of  autophagy  with  various  lysosomal 
inhibitors  enhanced  perifosine-induced  apoptosis  (Fig.  11),  suggesting  that  autophagy  induced 
by  perifosine  is  a  survival  mechanism,  and  that  its  inhibition  will  enhance  perifosine’s  apoptosis- 
inducing  activity.  We  are  currently  studying  whether  inhibition  of  autophagy  enhances 
perifosine’s  anticancer  activity  in  animal  xenograft  models. 

Aim  4:  To  determine  whether  DRs,  DcRs,  c-FLIP,  and  procaspase-8  serve  as  biomarkers 
for  lung  cancer  chemoprevention  and  therapy. 

Summary  of  Research  Findings 

This  work  will  be  initiated  upon  receipt  of  sufficient  tissue  slides  from  the  clinical  trial. 

Key  Research  Accomplishments 

1.  c-FLIP  is  overexpressed  primarily  in  transformed  malignant  lung  epithelial  cell  lines  and 
may  serve  as  a  promising  therapeutic  target  for  prevention  and  therapy  of  lung  cancer. 

2.  The  synthetic  atypical  retinoid  SHetA2  induces  apoptosis  and  enhances  TRAIL-induced 
apoptosis  involving  up-regulation  of  DR5  and  down-regulation  of  c-FLIP. 

3.  CDDO-Me-induced  up-regulation  of  death  receptor  5  is  coupled  with  endoplasmic 
reticulum  (ER)  stress  via  a  CHOP-dependent  mechanism  involving  JNK  activation. 

4.  Perifosine  inhibits  Akt/mTOR  signaling  and  subsequently  induces  autophagy.  Inhibition 
of  this  autophagy  augments  perifosine’s  apoptosis-inducing  activity,  thus  suggesting  a 
therapeutic  strategy  of  combining  perifosine  with  an  autophagy  inhibitor. 

Conclusion 


Appropriate  modulation  of  the  extrinsic  death  receptor-mediated  apoptotic  pathway  such  as 
upregulation  of  DR5  and/or  reduction  of  c-FLIP  levels  by  small  molecules  may  eliminate 
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premalignant  or  malignant  lung  epithelial  cells  via  promoting  apoptotic  cell  death  to  achieve 
cancer  chemopreventive  and  therapeutic  goals. 

The  potential  of  the  modulation  of  DR5  and  c-FLIP  as  predictive  biomarkers  for  certain  drugs  in 
the  clinic  warrants  further  investigation.  Inhibition  of  autophagy  may  be  a  good  strategy  to 
augment  certain  anticancer  agents’  anticancer  activity. 


Project  5:  Molecular  Strategies  Targeting  the  AKT  Signaling  Pathway  for  Lung  Cancer 
Chemoprevention  and  Therapy 

(PI  and  co-PI:  Ho-Young  Lee,  Ph.D.,  Edward  S.  Kim,  M.D.) 

Our  goal  is  to  find  novel  chemopreventive/therapeutic  agents  that  can  prevent  lung 
carcinogenesis  effectively.  Results  from  our  work  and  others’  have  demonstrated  that  Akt,  which 
has  a  clear  role  in  cellular  survival  and  transformation,  is  constitutively  active  in  premalignant 
and  malignant  HBECs  and  in  NSCLC  cell  lines.  These  findings  suggest  an  importance  of 
PI3K/Akt  signaling  pathway  in  lung  carcinogenesis.  The  purpose  of  our  studies  is  to  determine 
whether  activation  of  Akt  induces  malignant  transformation  of  HBE  cells  and  to  develop  novel 
agents  inhibiting  Akt  activity  as  a  strategy  to  prevent  lung  carcinogenesis. 

Aim  1  Develop  a  retroviral  vector  expressing  constitutively  active  Akt  and  characterize 
the  in  vitro  and  in  vivo  effects  of  Akt  activation  on  the  malignant  transformation  of 
HBE  cells. 

Summary  of  Research  Findings 

As  we  reported  last  year,  we  finished  constructing  retroviral  vectors  expressing  constitutively 
active  or  dominant  negative  Aktl ,  2,  or  3.  The  Akt  constructs  were  confirmed  by  sequencing  and 
Western  blot  analysis.  Viral  titers  have  been  determined  by  the  colony  formation  analysis.  To 
analyze  roles  of  Akt  1,  2  or  3  in  the  survival  of  HBE  cells,  BEAS2B  cells  were  stably  transfected 
with  retroviral  vectors  expressing  constitutively  active  Akt  1,2,3.  We  have  been  analyzing  the 
survival  of  the  cells  in  the  presence  of  H202,  and  found  that  Aktl,  2,  and  3  protected  HBE  cells 
from  cell  death  (data  not  shown). 

Aim  2  Evaluate  the  ability  of  chemopreventive  agents  used  in  VITAL  trials  (gefitinib, 
erlotinib,  SCH66336,  and  celecoxib,  alone  and  in  combination)  to  inhibit  Akt 
activity  and  induce  apoptosis  in  transformed  HBE  and  NSCLC  cell  lines. 

Summary  of  Key  Research  Findings 

Due  to  increased  expression  of  IGFs  detected  in  human  preneoplastic  bronchial  tissues,  in 
which  mutations  of  p53  or  KRAS  frequently  occur  (1),  we  assessed  whether  genetic  alterations 
of  p53  or  KRAS  (VI 2)  were  associated  with  increased  expression  of  IGFs  and  IGF-1R  activation. 
We  analyzed  immortalized  HBE  cells  (HBEC3)  and  derivatives,  p53  short-interfering  RNA- 
(siRBA)-expressing  cells  (HBEC3/p53i),  or  RASv12-expressing  (HBEC3/RASv12)  HBEC3  cells, 
which  harbor  several  tumorigenic  characteristics  but  are  not  yet  fully  malignant.  Thus,  these 
cells  mimic  the  premalignant  stage  human  lung  carcinogenesis  (2).  Notably,  both 
HBEC3/RASv12  and  HBEC3/p53i  cells  showed  markedly  increased  IGF-1  and  IGF-2  mRNA 
expression  (Figure  1A)  and  IGF-1  R  phosphorylation  at  Tyrl  131  (Fig.  IB)  and  Tyrl  135/1 136. 
These  cells  also  showed  markedly  increased  Akt  phosphorylation  (Ser  473)  compared  to  control 
HBEC3  cells. 
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Despite  the  fact  that  TCs  are 
genotoxic  and  may  induce  cell  death, 
HBE  cells  have  shown  increased 
viability  when  exposed  to  NNK  (3). 
Thus,  we  sought  to  determine 
whether  activated  IGF-1R  exerted 
protective  effects  on  TC-exposed 
lung  epithelial  cells  by  analyzing  the 
viability  of  NNK-exposed  HBEC3  cell 
derivatives.  HBEC3  cell  derivatives 
showed  markedly  increased  viability 
when  exposed  to  NNK  in  the 
absence  of  EGF  (Fig.  2A),  with  the 
greatest  increase  seen  in  p53  RNAi- 
and  RASv12-expressing  HBEC3 
cells.  Moreover,  p53  RNAi-  and 
RASv12-expressing  HBEC3  cells 
showed  the  greatest  increase  in  the 
numbers  of  foci  formed  upon 
reaching  a  confluent  state  (Fig.  2B) 
and  anchorage-independent  colony 
forming  (Fig.  2C).  The  NNK-induced  increases  in  viability,  foci  forming  ability  and  anchorage- 
independent  colony  forming  activity  induced  by  exposure  in  p53  RNAi-  and  RASv12-expressing 
HBEC3  cells  were  almost  completely  suppressed  by  PQIP  (an  IGF-1R  TKI).  These  findings 
indicated  the  importance  of  autocrine  IGFs  in  TC-induced  HBE  cell  transformation. 


HBEC/ 

Ras 

Figure  1A.  Expression  of  IGF-1  and  IGF-2 
from  HBEC3,  HBEC3/RASvl2  and  HBEC3/p5 
3i  cells  cultured  without  supplement  using 
quantitative  real-time  polymerase  chain 
reaction 


Figure  IB.  Enhanced  IGF  signaling 
in  the  HBEC3,  HBEC3/RASvl2,  HBE 
C3/p53i  and  HBEC3/  RASvl2p53i  cell 
s  is  shown  with  Western  blotting  of 
the  markers  of  the  signaling. 


Dependence  of  TC-exposed  HBE  cells  IGF-1  R  signaling  for  maintaining  transformed 
phenotypes 

The  findings  above  showed  an  important  role  for  IGF-1  R-mediated  signaling  in  the 
transformation  of  HBE  cells,  especially  that  stimulated  by  TCs.  We  therefore  questioned 
whether  TC-exposed  premalignant  and  malignant  HBE  cells  are  dependent  on  IGF-1  R  signaling 
for  sustained  proliferation  and  survival.  We  evaluated  the  effects  of  the  inhibition  of  IGF-1  R 
signaling  on  an  in  vitro  model  of  progressive  lung  carcinogenesis.  The  model  was  composed  of 
the  following  cell  lines:  1799  (a  derivative  of  SV40  large  tumor  antigen-immortalized  bronchial 
epithelial  BEAS2B  exposed  to  beeswax),  used  as  a  control;  non-tumorigenic  1198  (a  derivative 
of  BEAS2B  exposed  to  cigarette-smoke  condensate);  and  tumorigenic  1170-1  (a  derivative  of 
BEAS2B  exposed  to  cigarette-smoke  condensate)  (4,  5).  The  1198  and  1170-1  cells  exhibited 
greater  levels  of  IGF-1  R  and  IRS-1  phosphorylation  (Fig.  3A)  than  1799  cells.  The  1198  and 
1170-1  cells  infected  with  adenovirus-expressing  dominant-negative  IGF-1  R  (Ad-dnIGF-IR) 
showed  a  dose-dependent  decrease  in  cell  viability  (Fig.  3B)  and  anchorage-dependent  colony 
formation  (Fig.  3C)  compared  to  those  infected  with  a  control  virus  (Ad-EV).  In  contrast,  1799 
cells  were  not  affected  by  the  Ad-dnIGF-IR  treatment.  1198  and  1170-1  cells  (Fig.  3D)  showed 
significantly  decreased  viability  in  response  to  the  IGF-1  R  TKI  (PQIP),  but  not  to  EGFR  TKI 
(erlotinib)  when  treated  in  the  absence  of  a  growth  factor.  Neither  PQIP  nor  erlotinib  affected  the 
viability  of  1799  cells.  These  findings  indicated  that  IGF  -1R  activation  is  required  for  the 
transformation  of  TC-exposed  HBE  cells. 


47 


Army  Award  W81XWH-04-1-0142;  Waun  Ki  Hong,  M.D. 

Annual  Report:  Reporting  Period  15  December  2007  -  14  December  2008 


Fig2A 


m  Con  □  NNK 

CD  PQIP  rzi  PQIP+NNK 


Fig2B 


Control 

PQIP 


NNK 

PQIP+NNK 


511 
“  40 
1.30 


o  20 


10 


***  *** 

T 

HBEC3  HBEC3rp53i-  RASV12 
_ -  PQIP 


Control 


NNK  | 


HBEC3Jp53i-  RAS 


Fig  2C 


H  Control  E3  NNK 
a  PQIP  M  PQIP+NNK 


350 

*** 

^-i 

300 

«  250 

Q] 

E  200 

g  150 

'S  100 

_o  511 

0 

HBEC3  HBEC3Jji53i-  RASV 

Control 


NNK 


HBEC3lji53i-  RAS'11 


Figure  2.  Insulin-ll=:e  graiut^ifectors  (IGFsj  arid  tobacco  canchogens  synergEing  to  stimulate  the  transformation  of  hurian  bronchial 
epithelium  (FIBE)  cells.  HBEC3  cells  and  a  variant  expressing  oncogenic  KRASvl  2  (Ras)  and  knockdown  of  p53  by  RNAi  (p53i)  \m 
ne  untreated  or  treated  with  l\|\IK  (1  pM,  5  |JV1)  as  indicated  and  then  continuously  treated  with  PQIP  (1  |Jvlj  or  no  tneahient  for  3  d 
ays  in  the  absence  or  presence  of  hNK.  MTT  (A,  B)  andfbci-foiming  assays  (C)  were  performed.  A  representative  picture  of  foci  for¬ 
mation  of  HBEC3  cells  epnesshg  K-RASv12  and  p53  RNAi  is  shown.  Data  are  means  ±  standard  error  of  the  mean.  Statistical  sig 
niticance  of  differences  was  determined  using  a  two-tailed  Student's  t test.  *  (P  <0.05];  **P  <  0.01  ;*“P  <  0.001. 
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Figure  3.  Effects  of  insulin-like  growth  factor-1  receptor  (IGF-1R)  signaling  blocking  on  the  survival  of  tobacco  carcinogens  (TC)-induced  in  vitro  lung 
carcinogenesis  model  cell  lines.  A)  Western  blotting  of  the  IGFR  signaling  molecules  in  1788, 1198,  and  1170-1  cells.  Compared  to  immortalized  1799 
cells,  the  1198  and  1170-1  cells  showed  highly  activated  p-IGFR  and  p-IRS.  B,  C)  Effects  of  IGF-1R  blocking  with  dnIGF-lR  adenovirus  on  the  survival 
(B)  and  colony  formation  (C)  of  1799, 1198,  and  1170-1  cells.  The  cells  were  uninfected  or  infected  with  the  indicated  dose  of  empty  adenovirus  or 
adenovirus  expressing  dominant  negative  IGF-1R  (dnIGF-lR).  After  3  days  of  incubation  in  the  absence  of  a  growth  factor,  cells  were  subjected  to  the 
MTT  assay  (B)  and  anchorage-dependent  colony  formation  analysis  (C).  The  two  transformed  cells  depend  on  the  IGFR  signaling  for  the  survival  and 
colony  formation;  the  normal-like,  immortalized  cells  1799  do  not  require  IGFR  signaling.  D)  Effects  of  IGF-1R  blocking  with  PQIP  on  the  survival  of  1799 
and  1170-1  cells.  The  cells  were  treated  with  erlotinib  or  PQIP  at  the  indicated  concentration  and  subjected  to  an  MTT  assay  after  3  days.  Data  are  means 

±  standard  deviation  of  at  least  three  independent  experiments.  Statistical  significance  of  differences  was  determined  using  a  two-tailed  Student’s  t  test. 
Differences  between  groups  were  considered  significant  when  *  P  <  0.05;  **P  <  0.01;  and  ***  P  <  0.001. 
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Aim  3  Determine  whether  Akt  is  activated  in  bronchial  specimens  from  enrolled  patients 
in  VITAL  trials  and  whether  treatment  with  chemopreventive  agents  suppresses 
Akt  level  or  activity  in  these  patients. 

Summary  of  Research  Findings 

Our  study  assessed  the  role  of  IGF-1R  signaling  in  lung  carcinogenesis.  We  measured  the 
expression  levels  of  IGF-1,  IGF-2,  IGF-1R  and  phospho-IGF-IR  in  tissue  microarrays 
comprising  367  biopsy  specimens  of  normal,  hyperplasic,  squamous  metaplastic  (SQM),  low- 
grade  dysplacsia  (L-DYS),  and  high-grade  dysplasia  (H-DYS)  bronchial  tissue  specimens 
(Supplementary  Table  1  shows  patient  demographics).  IGF-1,  IGF-2,  and  IGF-1  R  expressions 
were  primarily  cytoplasmic  and  less  frequently  nuclear  (Fig.  1A).  We  observed  significantly 
higher  expression  of  IGF-1  (P  <  0.0001)  and  IGF-2  (P  =  0.004)  in  H-DYS  than  in  normal 
bronchial  specimens  (Figs.  4A  and  4B).  IGF-1  R  is  primarily  activated  by  its  cognate  ligands 
(IGF-1  and  IGF-2)  with  2  to  15  times  less  affinity  for  IGF-2  than  for  IGF-1.  IGFs  also  bind  to  the 
insulin  receptor  (IR)  with  roughly  100  times  less  affinity  than  insulin  (6).  We  assessed  whether 
increased  levels  of  the  IGFs  were  associated  with  activation  of  IGF-1  R  signaling  by  performing 
immunohistochemical  analysis  using  an  antibody  that  detects  phosphorylated  IGF-1  R  (pIGF- 
1R/IR)  (Tyr  1162/1163);  the  staining  appeared  in  the  cell  membranes,  cytoplasm,  and  nuclei. 
Staining  in  the  membrane  was  significantly  higher  in  H-DYS  than  in  normal,  hyperplastic,  SQM, 
and  L-DYS  bronchial  specimens  (Fig.  4B)  and  correlated  well  with  the  levels  of  IGF-1  and  IGF- 
2  (Fig.  4C). 

We  assessed  the  role  of  survivin,  one  of  the  inhibitors  of  apoptosis  protein  regulated  by  IGF- 
IR/Akt  signaling  pathway,  in  lung  carcinogenesis.  We  investigated  the  effects  of  the  tobacco 
component  nicotine  and  its  related  carcinogen  4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone 
(NNK)  on  survivin  expression  in  normal  human  bronchial  epithelial  (HBE)  cells  and  examined 
the  role  of  survivin  in  the  malignant  transformation  of  HBE  cells  induced  by  these  components. 
We  found  that  survivin  mRNA  expression  was  detected  in  41%  (7  of  17)  of  bronchial  brush 
specimens  from  heavy  smokers  (at  least  20  pack/year).  Nicotine  and  NNK  increased  survivin 
mRNA  and  protein  expression  levels  in  primary  cultured  normal  HBE  cells  and  immortalized 
HBE  cells.  Nicotine  and  NNK  stimulated  the  Akt/mammalian  target  of  rapamycin  (mTOR) 
pathway  in  normal  HBE  cells,  leading  to  increased  de  novo  synthesis  of  survivin  protein. 
Induced  survivin  expression  increased  the  survival  potential  of  the  cells,  which  was  blocked  by 
transfection  with  survivin-specific  small  interfering  RNA.  Small  interfering  RNA-induced  down- 
regulation  of  survivin  expression  also  suppressed  the  tumorigenic  potential  of  premalignant  and 
malignant  HBE  cells  exposed  to  the  tobacco  components.  These  findings  suggest  that  NNK  and 
nicotine  induce  survivin  protein  synthesis  in  normal  HBE  cells  by  activating  the  Akt/mTOR 
pathway  and,  thus,  blockade  of  the  pathway  effectively  inhibits  the  tobacco-induced  malignant 
transformation  of  HBE  cells.  These  results  are  published  in  Jin  et  al.,  Carcinogenesis 
29(8):1514-1622  (2008). 
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Figure  4.  Inririunchistjcchemical  (IHC)  scoring  of  insulin-like  growth  fetor  [IGF]  -1  and  IGF-2  in  notnial  and  preneoplastic  lesions  in  patients  with  non-small 
cell  lung  cancer.  A]  Representative  results  of  IHC  staining  of  IGF  signaling  bicmaikers  in  noimal  and  high-grade  dysplasia  [H-DYS]  specmens.  B]  Oomparis 
on  of  IHC  scoring  of  IGF-1,  IGF-2,  IGF-1  R,  orpIGF-IRhy  histologic  evaluation  of  normal,  hyperplastic  [HYP],  squamous  metaplastic  [SCJUt]  low-grade  dyspl 
asia  [L-DYS]  and  high-grade  dysplasia  [H-DYS],  Box  plots  show  the  medians  and  the  25th  and  75th  percentiles;  the  bars  give  the  ranges.  Statistical  signific 
ance  of  differences  was  determined  with  a  repeated-measures  model,  and  differences  between  gro^s  were  considered  significant  when  *(P  <  0.05),  **  (P  < 
0.011  ***  (P  <  0.0011  statistically  significant  compared  with  normal  tissue;  #  (P  <  0.051  statistically  significant  con^ared  with  HYP;  t  Ip  ::  0.05),  statistically 
significant  compared  with  SQM;  +(P  <  005],  statistically  significant  compared  with  L-DYS.  C)  Linear  regression  analysis  was  performed  to  evaluate  the  sig 
nificance  of  any  association  between  the  scoring  of  IGF-1 R,  IGF-1,  or  IGF-2  and  the  scoring  of  pIGF-IR  in  smokers  and  nonsmokets.  The  associations 
between  p-IGFRand  IGF-R,  IGF-1  and  IGF-2  were  evaluated  by  ushg  SpeaimaiYs  rankconelation  coefficient  [0.08, 0.279,  and  0.228,  respectively]  [The  p 
values  were  0.2373, 0.001,  and  0.0015  respectively).  The  correlations  were  considered  significant  when  P  <0.05. 
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Key  Research  Accomplishments 

•  IGF-1  and  IGF-2  levels  in  bronchial  tissue  specimens  containing  high-grade  dysplasia 
were  significantly  higher  than  in  those  containing  normal  epithelium,  hyperplasia,  and 
squamous  metaplasia. 

•  Derivatives  of  human  bronchial  epithelial  cell  lines  with  activation  mutation  in  KRAS 
(VI 2)  or  loss  of  p53  (genetic  changes  frequently  observed  during  lung  carcinogenesis) 
overexpressed  IGF-1  and  IGF-2.  Tobacco  carcinogen  (TC)  4-(methylnitrosamino)-1-(3- 
pyridyl)-1-butanone  (NNK)  enhanced  transformed  characteristics  of  these  cells,  which 
were  significantly  suppressed  by  inactivating  IGFR. 

•  NNK  and  nicotine  induced  survivin  protein  synthesis  in  normal  HBE  cells  by  activating 
the  Akt/mTOR  pathway. 

Conclusion 

Our  results  demonstrate  that  airway  epithelial  cells  produce  IGFs  in  an  autocrine  manner,  and 
these  IGFs  act  jointly  with  TCs  to  stimulate  lung  carcinogenesis.  Thus,  the  use  of  selective  IGF- 
1R  inhibitors  may  be  a  rational  approach  to  controlling  lung  cancer.  Blockade  of  the  pathway 
effectively  inhibits  the  tobacco-induced  malignant  transformation  of  HBE  cells. 


Core  B:  Biostatistics  &  Data  Management  Core 

(Core  Director:  J.  Jack  Lee,  Ph.D.) 

Core  Goals: 

1.  To  provide  statistical  design,  sample  size/power  calculations,  and  integrated, 
comprehensive  analysis  for  each  basic  science,  pre-clinical,  and  clinical  study. 

2.  To  develop  a  data  management  system  that  provides  tracking,  quality  control,  and 
integration  of  clinical,  pathological,  and  basic  science  data.  New  database  modules  will 
be  developed  and  integrated  to  the  existing  VITAL  web-based  database  and  with  the 
clinical  database  from  the  Department  of  Thoracic/Head  and  Neck  Medical  Oncology. 

3.  To  provide  statistical  and  data  management  support  for  genomic  and  imaging  studies 
including  microarray,  proteomics,  protein  antibody  array,  and  spiral  CT. 

4.  To  develop  and  adapt  innovative  statistical  methods  pertinent  to  biomarker-integrated 
translational  lung  cancer  studies. 

5.  To  generate  statistical  reports  for  all  projects. 

6.  To  collaborate  and  assist  all  project  investigators  in  the  publication  of  scientific  results. 

Summary  of  Research  Findings 

The  Biostatistics  and  Data  Management  Core  has  continued  to  work  actively  with  all  the  VITAL 
Projects  in  their  research  efforts  in  the  areas  of  biostatistical  support  and  consulting  in  the 
clinical  trial  design,  implementation,  conduct,  and  analysis  of  experimental  results. 

Key  Research  Accomplishments 

Our  major  effort  in  the  fifth  year  was  to  continue  to  provide  statistical  and  data  management 
support  for  the  “ReVITALization”  of  Project  1.  A  major  database  revamping  effort  has  been 
accomplished  to  integrate  the  retrospectively  collected  data  with  the  prospectively  conducted 
clinical  trial  data.  We  have  developed  and  continue  to  provide  enhancement  of  a  web-enabled 
database  system  to  facilitate  the  research  activities  of  the  VANGUARD  Trial.  New  database 
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modules  have  been  developed  and  integrated  into  the  existing  VITAL  database  and  with  the 
clinical  database  from  the  Department  of  Thoracic/Head  and  Neck  Medical  Oncology.  The  key 
activities  of  the  database  effort  are  summarized  below. 

•  ReVITALization  database  has  been  developed  to  extend  the  VITAL  database  providing 
additional  tissue  clinical  and  pathological  data  repositories  and  tissue  tracking. 

•  The  SQL  Server  2005  database  and  ASP.NET  web  application  were  used  with  VB.net 
language  to  provide  the  database  infrastructure.  Queries  and  SQL  2005  reports  were 
implemented.  Secure  Socket  Layer  (SSL)  and  secured  database  password  were  applied 
to  keep  data  transaction  protected  and  confidential.  Key  components  include: 

1)  Clinical  Module:  The  database’s  clinical  module  provides  user  friendly  input 
interfaces  for  entering  and  viewing  patients’  clinical  data  and  facilitates  the  patient 
search  by  medical  record  number  (MRN),  path  number  and  name.  The  module 
contains  the  following  web  forms: 

Patient  Information 

-  Social  History  (Alcohol  and  Smoking  History) 

Medical  History 

Other  Malignancy 

-  Treatments  (Surgery,  Chemotherapy,  Radiotherapy  and  Other  Treatments) 
Clinical  Staging 

Follow-up 

2)  The  Revitalization’s  pathological  module  collects  primary  and  metastasis  general 
data,  their  histology,  and  tumor  features,  which  can  be  used  by  our  tool  to 
automatically  determine  the  cancer’s  staging.  The  tissue  bank  module  of  frozen 
(tissue,  blood  and  pleural)  and  paraffin  keep  the  tissue  storage  location  and  the 
concentration,  volume  and  quality  of  the  DNA,  RNA  and  protein.  The  module 
contains  the  following  web  forms: 

Primary  and  Metastasis  Data  (Diagnosis  and  Surgery  Specimens) 

Histology 

Staging  and  Tumor  Information:  Cancer  staging  (TNM  classification)  is 
automatically  determined  by  the  system  based  on  the  tumor  information 
provided. 

Tissue  Bank  (Frozen  Tissue  and  Paraffin) 

3)  Reports:  Several  Excel  reports  are  provided  for  the  clinical  and  pathological 
modules. 

Clinical  Report 
Pathological  Report 
Patient  Report 
Accession  Report 
General  Information  Report 
Other  Malignancy  Report 
Surgery  Report 
Chemotherapy  Report 
Radiotherapy  Report 
Other  T reatment  Report 
Staging  Report 
Follow-up  Report 
Histology  Diagnosis  Report 
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4)  Dictionaries:  The  database  gives  control  for  the  users  to  conveniently  add  and 
update  dictionaries;  however,  dictionary  deletion  is  prohibited  for  users  to  prevent 
data  loss.  The  database  access  is  only  allowed  for  the  database  administrators. 

The  Revitalization  database  application  has  been  constantly  updated  to  assist  users  in  the  data 
entering  process.  The  retrospective  data  is  checked  and  imported  into  the  database  system, 
while  some  new  data  has  been  entering  by  the  users.  The  database  is  maintained  and  data  is 
preserved  by  the  database  administrator.  Selected  screen  shots  are  provided  in  the  Appendix  1 . 

Major  efforts  are  underway  to  support  the  ReVITALization  effort.  The  database  is  fully 
functional  and  has  captured  54  patients  registered  in  the  trial  as  of  December  2008.  The 
updating  of  clinical  follow-up  data  for  the  retrospectively  enrolled  patients  is  in  progress. 

In  addition,  we  have  continued  to  work  on  statistical  methods  for  evaluating  interactions  for 
combination  therapy  to  determine  whether  the  effects  are  synergistic,  additive,  or  antagonistic. 
One  manuscript  has  been  published  in  Biometrics  and  one  other  is  in  press  in  Statistics  in 
Biopharmaceutical  Research.  The  related  S-PLUS  codes  are  available  for  download  from 
http://biostatistics.mdanderson.org/SoftwareDownload/. 

Conclusion 

The  Biostatistics  and  Data  Management  Core  continues  to  provide  the  essential  statistical  and 
data  management  support  for  the  entire  VITAL  project.  As  the  project  is  coming  to  the  end  of 
the  funding  period,  more  efforts  will  be  shift  to  data  analysis,  interpretation,  and  report  writing. 


Core  C:  Pathology  and  Specimen  Procurement  Core 

(Core  Director:  Ignacio  Wistuba,  M.D.) 

Aim  1.  Develop  and  maintain  a  repository  of  tissue  and  other  biologic  specimens  from 
patients  enrolled  on  the  clinical  trials  in  Project  1. 

Summary  of  Research  Findings 

In  the  past  year,  we  have  continued  enrolling  patients  in  the  VITAL/Vanguard  trials  (Project  1). 
Up  to  December  2008,  53  patients  have  been  enrolled  and  currently  31  patients  have 
completed  the  baseline  and  12-month  bronchoscopy.  From  these  patients,  we  have  acquired, 
processed  and  banked  a  total  of  1,451  specimens  obtained  during  bronchoscopies  (Table  1).  In 
addition,  resected  specimens  from  the  majority  of  lung  cancer  and  head/neck  tumors  have  been 
reviewed  and  banked. 
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Table  1 .  Summary  of  specimens  collected  and  banked  in  the  Pathology  Core 


Type  of  Specimen 

Number 

Sputum 

96 

Buccal  Brush 

96 

Bronchial  Brush 

581 

Bronchial  Wash 

97 

Tissue  Specimens 

581 

Total 

1,451 

Aim  2.  Maintain  a  comprehensive  database  of  tissue  and  specimen  characteristics  from 
patients  enrolled  in  the  clinical  trials  of  Project  1,  including  pathologic  characteristics  of 
each  specimen,  inventory  and  distribution. 

Summary  of  Research  Findings 

The  Biostatistics  Core  has  developed  a  web-based  database  that  has  been  used  to  catalogue 
all  the  specimens  obtained  and  banked  in  the  Pathology  Core  and  to  report  pathology  diagnosis 
(see  previous  Core  report).  From  the  Vanguard  patients,  870  cytological  specimens  and  581 
bronchial  biopsies  have  been  tracked  and  inventoried  using  the  Web-site  database.  As  LIFE 
bronchoscopy  biopsies  have  been  performed  on  these  patients,  LIFE  abnormalities  have  been 
banked  in  a  database  and  will  be  correlated  with  histopathological  features  at  a  later  time. 

Aim  3.  Provide  comprehensive  pathologic  characterization  of  all  tissues  and  other 
biologic  specimens  and  assist  in  preparation  and  evaluation  of  studies  involving  these 
tissues. 

Summary  of  Research  Findings 

We  have  processed  and  histopathologically  diagnosed  581  tissue  specimens  from 
bronchoscopies  (Table  2)  with  two  H&E-stained  tissue  sections  examined  per  bronchial  biopsy. 
Although  normal  bronchial  epithelium  has  been  detected  in  at  least  one  site  examined  in  most 
subjects,  a  number  of  histopathological  changes  have  been  detected  following  tissue  analysis. 
The  most  frequent  abnormalities  detected  were  goblet  cell  metaplasia  (6.3%)  and  basal  cell 
hyperplasia  (17.9%).  Of  interest,  squamous  metaplasia  (3.4%)  and  dysplasia  (1.3%)  were 
infrequently  detected.  No  severe  dysplasia,  carcinoma  in  situ,  or  invasive  carcinoma  have  been 
detected  in  these  specimens.  Histopathological  analysis  of  the  resected  lung  and  head/neck 
cancer  samples  has  been  also  been  performed,  and  tissue  blocks  from  all  these  samples  are 
available  for  biomarker  analysis. 

Table  2.  Summary  of  the  1,231  histopathology  diagnoses  made  in  581  bronchial  biopsies 


obtained  from  clinical  trial  (Project  1 

i). 

Diagnosis 

N 

% 

No  Tissue/Denuded  Epithelium 

27 

2.2 

Normal  Epithelium 

848 

68.9 

Goblet  Cell  Metaplasia  (GCM) 

78 

6.3 

Basal  Cell  Hyperplasia  (BCH) 

208 

16.9 

Combined  GCM/BCH 

12 

1.0 

Squamous  Metaplasia 

42 

3.4 

Angiogenic  Squamous  Dysplasia  (ASD) 

2 

0.2 

Mild  Dysplasia 

8 

0.6 

Moderate  Dysplasia 

6 

0.5 
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Aim  4.  Provide  centralized  immunohistochemistry  and  laser  capture  microdissection 
services,  nucleic  acid  extractions  and  assistance  with  construction  and  evaluation  of 
tissue  arrays. 

Summary  of  Research  Findings 

As  reported  previously,  a  centralized  immunohistochemistry  (IHC)  laboratory  is  in  place  as  part 
of  the  Pathology  Core  with  manual  and  automated  IHC  techniques  and  in  situ  tissue-based 
methodologies,  such  as  fluorescent  in  situ  hybridization  (FISH)  and  laser  capture 
microdissection.  Tissue  microarray  (TMA)  construction  is  also  in  place,  and  a  complete  set  of 
lung  cancer  specimens  (N  =  400,  TMA  set  I)  and  preneoplastic  lesions  (preneoplasia  TMA)  has 
been  constructed  and  utilized. 

As  reported  previously,  from  a  subset  of  early  lung  cancer  surgically  resected  in  our  Institution 
from  1997  to  2002,  we  have  identified  and  selected  504  histologically  normal  and  abnormal 
bronchial,  bronchiolar  and  alveolar  epithelium  specimens  to  be  examined  for  molecular  marker 
expression  using  IHC;  all  these  specimens  have  been  placed  in  TMAs.  We  have  examined  31 
IHC  markers  (Table  3),  most  of  them  related  to  potential  targets  for  targeted-chemoprevention 
strategies.  During  the  previous  year,  we  have  examined  and  identified  5  additional  IHC  markers 
in  these  preneoplasia  specimens  (Table  3). 

Table  3.  Summary  of  immunohistochemistry  (IHC)  markers  examined  in  respiratory  epithelium 
specimens  placed  in  TMAs. _ 


Immunohistochemistry  Markers  Examined  in  Respiratory  Epithelium 

E-Cadherin  (Project  2) 

FGFR2  (Project  2) 

Caspase-8  (Project  2) 

Syndecan-1  (Project  2) 

STAT-5  (Project  2) 

Survivin  (Project  4) 

P70S6K  (Project  2) 

SCC-1  (Project  3) 

NF-kB 

SCC-2  (Project  3) 

IRAK-1  (Project  2) 

pCREB  (Project  3) 

IGF1  (Project  4) 

CREB  (Project  3) 

IGFR1  (Project  4) 

PTTG 

p-IGFRI  (Project  4)* 

Integrin-  (36 

IGF2  (Project  4)* 

N-cadherin 

EGFR  (Project  4) 

Vimentin 

p-EGFR  (Project  4)* 

(3-catenin 

Caveolin-1 

FUS-1 

bFGF  (Project  2) 

MM  P-9 

FGFR1  (Project  2) 

FEN1  (Project  2)* 

UBE2C  (Project  2)* 

*  Markers  examined  during  last  year  period 


We  examined  the  role  of  the  expression  of  several  pathways  and  multiple  markers  in  the  early 
pathogenesis  of  lung  cancer  using  both  tumor  and  preneoplasia  TMAs.  Last  year,  we  published 
the  characterization  of  the  tumor  suppressor  gene  FUS-1  (1 )  and,  in  collaboration  with  Project  2, 
the  transcription  factor  CREB  (2)  in  NSCLC  tumors  and  preneoplastic  lesions.  In  addition,  we 
submitted  two  articles  for  publication  describing  the  role  of  estrogen  and  progesterone  receptors 
(Raso  et  al.,  submitted  to  Clin  Cancer  Res)  and  epithelial-mesenchymal  transition  (EMT) 
phenomenon  (Prudkin  et  al.,  submitted  to  Mod  Pathol)  in  the  early  pathogenesis  for  lung  cancer. 
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Last  year,  we  published  our  data  on  the  sequence  of  molecular  events  involving  EGFR  gene  in 
the  early  pathogenesis  and  progression  of  EGFR- mutant  lung  adenocarcinomas  (3)  (Fig.  1).  We 
demonstrated  that  EGFR  mutations  and  protein  overexpression  are  early  phenomena  in  the 
pathogenesis  of  lung  adenocarcinoma  and  that  EGFR  mutation  precedes  an  increase  in  gene 
copy  number.  In  EGFR-mutant  adenocarcinoma  metastases,  the  higher  levels  of  EGFR 
overexpression  and  more  homogeneously  distributed  high  gene  copy  numbers  suggest  tumor 

progression.  Our  findings  have 
important  implications  for  the 
development  of  new  strategies  for 
targeted  chemoprevention  and 
therapy  in  lung  adenocarcinoma 
using  EGFR  inhibitors.  We  have 
recently  demonstrated  that  the 
EGFR  T790M  mutation, 
associated  with  resistance  to 
treatment  with  EGFR  small 
molecule  inhibitors,  is  present  in 
tumors  not  exposed  to  such 
treatment  and  may  represent  an 
early  phenomenon  in  the 
pathogenesis  of  EGFR  (T790M)- 
mutant  tumors  (4). 

Finally,  our  studies  on  the  role  of 
the  oncogene  TITF1  gene  (5)  in 
the  pathogenesis  of  NSCLC 
showed  that  TITF1  is  amplified  in 
both  major  types  of  NSCLC, 
adenocarcinomas  and  squamous 
cell  carcinoma  (18%)  (Tang.et  al., 
presented  in  the  AACR  Annual 
Meeting  2008,  and  manuscript  in  preparation),  and  in  adenocarcinoma  patients  the  combination 
of  EGFR  and  TITF1  amplification  correlate  with  disease  outcome  (Tang  et  al.,  submitted  to 
AACR  Annual  Meeting  2009,  and  manuscript  in  preparation). 

Aim  5.  Identify  ~600  surgically  resected  tissue  specimens  from  stages  I/ll  NSCLC  (tumor, 
normal  and  abnormal  adjacent  bronchial  epithelium  specimens)  and  their  complete 
clinical  and  pathologic  information. 

Summary  of  Research  Findings 

As  part  of  the  re-VITALization  plan,  we  proposed  to  identify  tumor  and  adjacent  normal 
epithelium  and  preneoplastic  lesions  from  678  NSCLCs  to  be  part  of  the  TMA  set  II.  We  have 
completed  the  review  of  over  700  stages  I  to  MIA  lung  cancer  surgically  resected  specimens  in 
our  Institution  from  January  2003  to  December  2005  and  obtained  archival  formalin-fixed 
paraffin-embedded  specimens  to  identify  tumor  and  adjacent  normal  or  preneoplastic  bronchial 
and  bronchiolar  epithelia.  Of  these,  678  cases  have  been  fully  characterized  pathologically  and 
TMA  construction  has  been  completed. 

The  detailed  pathological  information  included  the  following:  a)  tumor  histopathology:  histology 
type  (Table  4),  percentage  of  histology  sub-type  for  adenocarcinomas  (bronchioloalveolar, 
acinar,  papillary,  solid  and  micropapillary),  grade  of  differentiation,  necrosis,  fibrosis  and 


Figure  1 .  Proposed  sequence  of  EGFR  abnormalities 
occurring  in  the  early  pathogenesis  and  progression  of 
EGFR-mutant  lung  adenocarcinomas.  Normal  bronchial 
epithelium  (NBE)  field,  primary  tumor,  and  metastasis  sites.  Small 
circles,  NBE,  which  acquires  EGFR  mutations  and  EGFR  protein 
(total  and  phosphorylated)  overexpression  (gray  circles).  In  the 
primary  tumor  stage,  the  EGFR  copy  number  increases  (high 
polysomy  and  gene  amplification)  in  small  tumor  foci  (striped  ovals). 
In  the  metastasis  site,  tumor  cells  show  both  EGFR  mutation  and 
high  copy  number  throughout  most  of  the  lesion. 
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inflammation;  b)  tumor  gross  features:  type  of  surgical  resection,  lobule  location,  number  of 
nodules,  size,  location  (central  vs.  periphery),  invasion  (pleural,  mediastinum,  chest  wall, 
vascular,  neural  and  bronchial),  and  surgical  margin.  This  detailed  pathological  analysis  has 
been  recently  expanded  to  the  400  NSCLC  cases  included  in  the  TMA  set  I. 

Table  4.  Histology  diagnosis  of  the  678  NSCLC  identified  for  TMA  construction  and  marker 
analysis. 


By  reviewing  the  clinical  charts,  detailed  demographic  and  clinical  information  has  been 
obtained  in  all  the  cases,  including  follow-up  information  for  adjuvant  therapy,  secondary  tumor 
development,  recurrence,  metastasis  pattern  and  survival.  All  the  pathological  and  clinical  data 
has  been  entered  in  database  modules  developed  by  the  Biostatistic  Core  (Fig.  2).  Currently, 
research  personnel  from  Project  1  are  initiating  the  process  of  contacting  the  alive  patients  to 
update  the  follow-up. 

Aim  6.  Examine  over  40  biomarkers  in  those  specimens  by  immunohistochemical  (IHC) 
and  tissue  microarrays  (TMAs). 

Summary  of  Research  Findings 

TMA  construction.  Nineteen  TMAs  have  been  constructed  using  triplicate  tissue  cores  (1  mm 
diameter)  from  550  NSCLCs.  Quality  control  of  each  tissue  core  (total  1 .650  cores)  has  been 
completed  and  the  TMAs  histology  sections  have  been  sectioned  for  IHC  analysis. 

Immunohistochemistry  (IHC)  analysis.  Using  the  experience  obtained  by  analyzing  nearly 
130  markers  in  the  NSCLC  TMA  set  I,  we  have  initiated  the  IHC  analysis  of  the  Re-VITALization 
TMA.  Currently,  in  collaboration  with  Project  5  (P.I.:  Dr.  H-Y  Lee),  we  are  using  IHC  to  examine 
the  markers  IGFR1  and  p-IGFRI  in  TMAs,  and  we  are  optimizing  IGBP-3  and  p-SRC.  We  have 
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requested  an  unfunded,  one-year  extension  to  complete  the  IHC  analysis  of  the  remaining  36 
markers.  The  IHC  markers  to  be  examined  will  be: 

a)  Markers  obtained  from  among  differentially  expressed  genes  discovered  by  the  analysis  of 
normal  human  bronchial  epithelial  cells  and  premalignant,  transformed  and  tumorigenic  cell 
lines  derived  from  them  (Project  2,  J.  Minna,  L.  Mao,  and  R.  Lotan)  -  6  markers,  some 
oncogene  and  TSGs  abnormalities 

b)  Differentially  expressed  genes  in  abnormal  areas  of  the  bronchial  epithelium  identified  LIFE- 
bronchoscopy  (Project  3,  P.  Koo  and  W.  Hittelman)  -  6  markers 

c)  Death  receptor  (DR)-mediated  apoptosis  (Project  4,  F.  Khuri)  -  6  markers 

d)  Akt  and  IGF/IGFR  pathways  markers  (Project  5,  H-Y  Lee)  -  8  markers 

e)  Markers  obtained  from  the  gene  profiling  analysis  performed  in  50-60  VANGUARD  cases’ 
specimens  (Project  3,  L.  Mao)  -  6  markers 

Key  Research  Accomplishments 

•  Processed  and  diagnosed  1 ,450  bronchoscopy  specimens  from  53  patients  enrolled  in 
the  Vanguard  trial  (Project  1 ). 

•  Completed  the  TMA  construction  and  clinico-pathological  data  collection  of  678 
surgically  resected  NSCLC,  and  initiated  the  IHC  analysis  for  outcome  risk-model. 

•  Characterized  the  sequence  of  molecular  events  involved  in  the  early  pathogenesis  and 
progression  of  EGFR-mutant  lung  adenocarcinomas. 

•  Characterized  TITF1  gene  amplification  in  NSCLCs  and  demonstrated  that  gene 
amplification  occurs  in  squamous  cell  carcinomas  of  the  lung. 

Conclusion 

We  have  acquired  and  banked  1,450  specimens  from  bronchoscopies,  resected  specimens 
from  lung  cancer  and  head/neck  tumor  patients,  and  used  the  database  developed  by  the 
Biostatistics  Core  to  track  and  inventory  bronchoscopy  specimens  and  report  histopathological 
features  of  581  bronchial  mucosa  tissue  specimens.  We  have  examined  by 
immunohistochemistry  (IHC)  31  molecular  markers  in  a  large  series  (N=504)  of  respiratory 
epithelium  specimens  that  have  been  used  to  examine  several  IHC  markers  related  to  VITAL- 
related  projects.  We  have  developed  a  working  hypothesis  for  molecular  events  involved  in  the 
early  pathogenesis  of  EGFR-mutant  lung  adenocarcinomas.  We  have  completed  the 
construction  of  tissue  microarrays  (TMA)  and  the  collection  of  detailed  clinico-pathological  data 
from  678  NSCLCs,  and  initiated  the  molecular  markers’  analysis  by  IHC.  During  the  one-year 
extension,  we  will  complete  the  IHC  analysis  of  this  TMA  set  including  40  markers. 
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KEY  RESEARCH  ACCOMPLISHMENTS  (IN  SUMMARY) 

Project  1 :  Biologic  Approaches  for  Adjuvant  T reatment  of  Aerodigestive  T ract  Cancer 

•  Enrolled  9  more  patients  in  the  Vanguard  study  with  two  additional  patients  pending,  for 
a  total  of  55. 

•  Continued  to  collect  patient  clinical  data  and  tissues  for  distribution  to  support  research 
projects  in  the  VITAL  grant. 

•  31  patients  will  have  completed  the  12-month  mark/bronchoscopy. 

•  ReVITALization  plan  is  being  implemented  with  data  entry  into  the  pathology  database 
related  to  the  500  archived  tissue  specimens,  and  updating  of  the  clinical  database  for 
these  patients. 

Project  2:  Identification  of  Biomarkers  of  Response  to  Chemoprevention  Agents  in 
Lung  Epithelium 

•  Generated  HBECS  from  over  45  different  individuals  and  immortalized  10  peripheral 
small  airway  epithelial  cells  (HSAECs). 

•  Testing  “preneoplastic”  HSAECs  with  chemoprevention  agents  and  performing 
microarray  mRNA  expression  profiles  and  array-based  CGH  copy  number  profiles. 

•  Compared  104  cancer  lines  representing  diverse  tissue  origins  and  discovered  an 
unexpected  frequency  of  activated  genes  in  amplification  'hotspots.' 

•  Using  qRT  PCR  technology  on  RNA  isolated  from  microdissected  tumors  and  normal 
lung  epithelium  from  30  patients,  found  nuclear  hormone  receptors  whose  expression 
differed  between  tumor  and  normal  lung  from  the  same  patient,  and  between  different 
lung  cancers. 

•  Discovered  that  BRF2  was  amplified  and  overexpressed  in  preneoplastic  carcinoma  in 
situ  (CIS)  lesions  in  the  airway  epithelium  of  lung  cancer  patients 

•  Differentially  expressed  genes  such  as  UBE2C,  MCM2,  FEN1,  and  BIRC5  are  located 
within  gene  interaction  networks  significantly  modulated  between  normal,  immortalized, 
transformed  and  tumorigenic  cells. 

•  The  findings  suggest  that  changes  in  MCM2,  SFN,  TPX2,  FEN,  and  UBE2C  genes  are 
all  important  for  the  replication  of  immortalized,  transformed  and  tumorigenic  cells. 

•  UBE2C  protein  level  is  undetectable  in  normal  appearing  lung  epithelium,  but  increases 
in  aberrantly  proliferating  lesions  and  premalignant  lesions.  In  addition,  there  was  an 
increased  expression  in  adenocarcinoma  compared  to  normal  epithelium. 

•  In  lung  cancer  patients  with  stage  I  NSCLC,  both  overall  survival  and  progression-free 
survival  were  statistically  significantly  lower  in  those  cases  where  UBE2C  expression 
was  positive. 

•  Successfully  amplified  and  hybridized  138  samples  using  Affymetrix  ST1.0  Human  gene 
chips. 

Project  3:  Premalignant  Bronchial  Epithelia:  Molecular  and  Cellular  Characterization  of 
Lung  Tumorigenesis 

•  Completed  processing  of  14  pair  of  patient  samples  and  obtained  the  microarray  data 
sets. 

•  Identified  two  distinct  populations  among  patient  samples,  which  has  helped  us  to  avoid 
the  confounding  effect  of  a  mixed  sample  population  and  enabled  us  to  find  consensus 
in  gene  regulation  patterns. 

•  Identified  the  cytokine-cytokine  receptor  pathway  as  one  of  the  major  signal  pathways 
that  may  dictate  the  differences  between  the  white-light  or  LIFE  abnormal  tissue  and 
normal  tissue,  and  could  be  critical  in  the  early  development  of  lung  cancer. 
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•  Located  panels  of  genes  that  are  unanimously  regulated  in  each  distinct  sample  cluster, 
and  found  that  several  of  these  genes  coincide  with  genes  previously  reported  to  be 
important  for  tumorigenesis. 

•  Demonstrated  that  cells  further  along  the  multi-step  tumorigenesis  pathway  tended  to 
divide  more  frequently  away  from  the  basement  membrane  and  their  mitotic  spindle 
orientation  relative  to  the  plane  of  the  basal  layer  was  highly  variable. 

•  Confirmed  that  as  cultures  fashioned  toward  the  tumor  phenotype,  they  more  frequently 
exhibited  mitosis  away  from  the  basal  layer  and  the  orientation  angle  of  the  mitotic 
spindle  (and  degree  of  mitotic  error)  increased  with  distance  from  the  basal  layer. 

•  KG501,  a  small  molecule  inhibitor  targeting  CREB  activity,  and  neutralizing  antibodies 
against  CXCL5  and  CXCR2  blocked  the  migration  of  vascular  endothelial  cells  induced 
by  inflammatory  cytokine. 

•  Demonstrated  that  EGFR  ligands  induce  hyperplasia  of  bronchial  cells  cultured  in  an 
organotypic  3-dimensional  culture  method. 

•  Demonstrated  that  cyclin  D1  upregulated  by  API  transcription  factor  plays  a  critical  role 
in  the  hyperplasia  of  the  bronchial  epithelial  cells  and  demonstrated  that  erlotinib  blocks 
the  hyperplasia. 

Project  4:  Modulation  of  Death  Receptor-Mediated  Apoptosis  for  Chemoprevention 

•  c-FLIP  is  overexpressed  primarily  in  transformed  malignant  lung  epithelial  cell  lines  and 
may  serve  as  a  promising  therapeutic  target  for  prevention  and  therapy  of  lung  cancer. 

•  The  synthetic  atypical  retinoid  SHetA2  induces  apoptosis  and  enhances  TRAIL-induced 
apoptosis  involving  up-regulation  of  DR5  and  down-regulation  of  c-FLIP. 

•  CDDO-Me-induced  up-regulation  of  death  receptor  5  is  coupled  with  endoplasmic 
reticulum  (ER)  stress  via  a  CHOP-dependent  mechanism  involving  JNK  activation. 

•  Perifosine  inhibits  Akt/mTOR  signaling  and  subsequently  induces  autophagy.  Inhibition 
of  this  autophagy  augments  perifosine’s  apoptosis-inducing  activity,  thus  suggesting  a 
therapeutic  strategy  of  combining  perifosine  with  an  autophagy  inhibitor. 

Project  5:  Molecular  Strategies  Targeting  the  AKT  Signaling  Pathway  for  Lung  Cancer 
Chemoprevention  and  Therapy 

•  IGF-1  and  IGF-2  levels  in  bronchial  tissue  specimens  containing  high-grade  dysplasia 
were  significantly  higher  than  in  those  containing  normal  epithelium,  hyperplasia,  and 
squamous  metaplasia. 

•  Derivatives  of  human  bronchial  epithelial  cell  lines  with  activation  mutation  in  KRAS 
(VI 2)  or  loss  of  p53  (genetic  changes  frequently  observed  during  lung  carcinogenesis) 
overexpressed  IGF-1  and  IGF-2.  Tobacco  carcinogen  (TC)  4-(methylnitrosamino)-1-(3- 
pyridyl)-1-butanone  (NNK)  enhanced  transformed  characteristics  of  these  cells,  which 
were  significantly  suppressed  by  inactivating  IGFR. 

•  NNK  and  nicotine  induced  survivin  protein  synthesis  in  normal  HBE  cells  by  activating 
the  Akt/mTOR  pathway. 

Core  B:  Biostatistics  &  Data  Management  Core 

•  ReVITALization  database  has  been  developed  to  extend  the  VITAL  database  providing 
additional  tissue  clinical  and  pathological  data  repositories  and  tissue  tracking. 

•  The  SQL  Server  2005  database  and  ASP.NET  web  application  were  used  with  VB.net 
language  to  provide  the  database  infrastructure.  Queries  and  SQL  2005  reports  were 
implemented.  Secure  Socket  Layer  (SSL)  and  secured  database  password  were  applied 
to  keep  data  transaction  protected  and  confidential. 
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•  Database  is  fully  functional  and  has  captured  54  patients  registered  in  the  trial  as  of 
December  2008. 

Core  C:  Pathology  and  Specimen  Procurement  Core 

•  Processed  and  diagnosed  1 ,450  bronchoscopy  specimens  from  53  patients  enrolled  in 
the  Vanguard  trial  (Project  1 ). 

•  Completed  the  TMA  construction  and  clinico-pathological  data  collection  of  678 
surgically  resected  NSCLC,  and  initiated  the  IHC  analysis  for  outcome  risk-model. 

•  Characterized  the  sequence  of  molecular  events  involved  in  the  early  pathogenesis  and 
progression  of  EGFR-mutant  lung  adenocarcinomas. 

•  Characterized  TITF1  gene  amplification  in  NSCLCs  and  demonstrated  that  gene 
amplification  occurs  in  squamous  cell  carcinomas  of  the  lung. 
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CONCLUSIONS 

Project  1:  The  ultimate  goal  is  the  development  of  a  risk  model  for  development  of  SPT  and 
recurrence.  This  model  will  be  developed  utilizing  the  resources  from  both  the  Vanguard  trial 
and  our  Lung  Cancer  Tissue  Bank  as  a  top  priority  for  our  program. 

Project  2:  Amplification  is  a  far  more  common  mechanism  of  oncogene  activation  than 
previously  believed  and  specific  regions  of  the  genome  are  hotspots  of  amplification.  These 
amplicons  provide  important  new  biomarkers  that  may  be  present  in  preneoplastic  lesions  and 
could  be  promising  targets  for  the  monitoring  and  chemoprevention  of  lung  cancer.  Nuclear 
hormone  receptors  (NRs)  may  provide  prognostic  value  in  lung  cancer  patients,  and  we  would 
use  these  NRs  expression  profiles  to  hormonally  manipulate  lung  cancer  or  as  chemoprevention 
targets.  Genetic  alteration  of  BRF2  represents  a  novel  mechanism  of  lung  tumorigenesis 
through  the  increase  of  Pol  Ill-mediated  transcription  in  cancer. 

Differentially  expressed  genes  (e.g.,  UBE2C,  MCM2,  FEN1,  and  BIRC5)  function  in  cell  cycle 
control  and  display  significant  interactions  with  each  other  and  with  other  known  cancer-related 
genes  in  significant  topological  gene  networks.  Patient  survival  and  progression-free  survival 
were  statistically  significantly  lower  in  those  cases  where  UBE2C  expression  was  positive. 

Project  3:  Cytokine-cytokine  receptor  pathway  was  identified  as  one  of  the  major  signal 
pathways  that  may  dictate  the  differences  between  the  white-light  or  LIFE  abnormal  tissue  and 
normal  tissue,  and  hence  could  play  a  critical  role  in  the  early  development  of  lung  cancer. 
Studies  demonstrated  that  re-regulating  the  spatial  location  of  mitotic  events  in  the  bronchial 
epithelium  may  decrease  the  rate  of  ongoing  genetic  instability  and  perhaps  delay  cancer  onset 
in  individuals  with  increased  lung  cancer  risk. 

KG501,  a  small  molecule  inhibitor  targeting  CREB  activity,  and  neutralizing  antibodies  against 
CXCL5  and  CXCR2  hold  promise  as  preventive  and/or  a  therapeutic  strategy  for  lung  cancer. 
EGFR  ligands  induce  bronchial  hyperplasia,  which  can  be  inhibited  by  erlotinib. 

Project  4:  Appropriate  modulation  of  the  extrinsic  death  receptor-mediated  apoptotic  pathway 
such  as  upregulation  of  DR5  and/or  reduction  of  c-FLIP  levels  by  small  molecules  may  eliminate 
premalignant  or  malignant  lung  epithelial  cells  via  promoting  apoptotic  cell  death  to  achieve 
cancer  chemopreventive  and  therapeutic  goals. 

The  potential  of  the  modulation  of  DR5  and  c-FLIP  as  predictive  biomarkers  for  certain  drugs  in 
the  clinic  warrants  further  investigation.  Inhibition  of  autophagy  may  be  a  good  strategy  to 
augment  certain  anticancer  agents’  anticancer  activity. 

Project  5:  Our  results  demonstrate  that  airway  epithelial  cells  produce  IGFs  in  an  autocrine 
manner,  and  these  IGFs  act  jointly  with  TCs  to  stimulate  lung  carcinogenesis.  Thus,  the  use  of 
selective  IGF-1R  inhibitors  may  be  a  rational  approach  to  controlling  lung  cancer.  Blockade  of 
the  pathway  effectively  inhibits  the  tobacco-induced  malignant  transformation  of  HBE  cells. 

Core  B:  The  Biostatistics  and  Data  Management  Core  continues  to  provide  the  essential 
statistical  and  data  management  support  for  the  entire  VITAL  project.  As  the  project  is  coming 
to  the  end  of  the  funding  period,  more  efforts  will  be  shift  to  data  analysis,  interpretation,  and 
report  writing. 
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Core  C:  We  have  acquired  and  banked  1,450  specimens  from  bronchoscopies,  resected 
specimens  from  lung  cancer  and  head/neck  tumor  patients,  and  used  the  database  developed 
by  the  Biostatistics  Core  to  track  and  inventory  bronchoscopy  specimens  and  report 
histopathological  features  of  581  bronchial  mucosa  tissue  specimens.  We  have  examined  by 
immunohistochemistry  (IHC)  31  molecular  markers  in  a  large  series  (N=504)  of  respiratory 
epithelium  specimens  which  have  been  used  to  examine  several  IHC  markers  related  to  VITAL- 
related  projects.  We  have  developed  a  working  hypothesis  for  molecular  events  involved  in  the 
early  pathogenesis  of  EGFR- mutant  lung  adenocarcinomas.  We  have  completed  the 
construction  of  tissue  microarrays  (TMA)  and  the  collection  of  detailed  clinico-pathological  data 
from  678  NSCLCs,  and  initiated  the  molecular  markers’  analysis  by  IHC.  During  the  one  year 
extension,  we  will  complete  the  IHC  analysis  of  this  TMA  set  including  40  markers. 
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2)  ReVITALization’s  clinical  module:  Other  Malignancy 
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3)  ReVITALization’s  clinical  module:  Treatment:  Surgery,  Chemotherapy, 
Radiotherapy  and  Other  Treatments. 
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4)  ReVITALization’s  clinical  module:  Staging 
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5)  ReVITALization’s  clinical  module:  Follow  up 
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6)  ReVITALization’s  pathological  module:  Tissue  Pathological  Data 
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7)  ReVITALization’s  pathological  module:  Histology 


-7- 


Army  Award  W81XWH-04-1-0142;  Waun  Ki  Hong,  M.D. 
Annual  Report: 


Appendix  1 

Biostatistics  and  Data  management  Core 


8)  ReVITALization’s  pathological  module:  Staging  and  Tumor  Information 
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9)  ReVITALization’s  pathological  module:  Tissue  Bank  (Frozen  and  Paraffin) 
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10)  ReVITALization’s  dictionary  module 
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1 1 )  The  example  of  the  ReVITALization’s  Excel  reports. 
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12)  Vanguard  Study:  Status  Snapshot 
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13)  Vanguard  Study:  Sample  Collection 
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14)  Vanguard  Study:  Monthly  Accrual 
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Deregulated  EGFR  Signaling  during  Lung  Cancer  Progression:  Mutations, 
Amplicons,  and  Autocrine  Loops 


Adi  F.  Gazdar  and  John  D.  Minna 

One  or  more  members  of  the  family  of  epidermal  growth 
factor  receptor  (EGFR)  genes  are  overexpressed  or  otherwise 
deregulated  in  virtually  all  epithelial  tumors,  including  non¬ 
small  cell  lung  cancers  (NSCLC).  This  and  related  observa¬ 
tions  on  the  importance  of  protein  phosphorylation  and  the 
discovery  that  the  first  identified  oncogene,  v-Src,  is  a  protein 
kinase  led  John  Mendelsohn  and  Gordon  Sato  to  select  EGFR 
as  the  first  target  of  molecular  targeted  therapy  more  than 
20  years  ago  (1,  2).  EGFR  family  members  are  deregulated 
in  cancers  by  the  following  three  fundamental  mechanisms: 
activating  gene  mutations,  increased  gene  copy  number  (via 
amplification  or  polysomy),  and  altered  ligand  expression 
(with  possible  formation  of  autocrine  loops;  ref.  3).  Two  re¬ 
ports  in  this  issue  of  the  journal  advance  our  understanding 
of  the  role  of  all  three  mechanisms  in  the  pathogenesis  and 
progression  of  NSCLC  (4,  5).  Before  discussing  these  reports, 
however,  we  will  present  background  information  on  EGFR 
signaling  and  its  deregulation  in  cancers. 

Reversible  protein  phosphorylation  as  a  crucial  regulator 
of  many  essential  cell  functions  has  been  elucidated  over 
the  past  50  years.  A  superfamily  of  more  than  500  highly 
conserved  protein  kinase  genes  contains  about  2%  of  the  gen¬ 
ome  (6).  Specific  kinases  phosphorylate  serine /threonine  or 
tyrosine  residues  or  have  dual  specificity.  The  tyrosine  ki¬ 
nases,  which  catalyze  the  transfer  of  y  phosphate  of  ATP 
to  tyrosine  residues  on  protein  substrates,  fall  into  two 
classes:  transmembrane  receptors  (receptor  tyrosine  kinase) 
and  nonreceptors.  Subclass  I  of  the  receptor  tyrosine  kinases 
is  the  EGFR  family,  which  consists  of  four  members:  EGFR 
(or  EGFR1,  ERBB2,  HER1),  EGFR2  (or  ERBB2r  HER2),  EGFR3 
(or  ERBB3,  HER3)r  and  EGFR4  (or  ERBB4 ,  HER4 ;  ref.  3). 
Receptor-ligand  interaction  results  in  formation  of  homodi¬ 
mers  or  heterodimers  (between  family  members),  activation 
of  the  intrinsic  kinase  domain,  and  phosphorylation  of  speci¬ 
fic  tyrosine  residues  in  the  cytoplasmic  tail  of  the  receptor. 
The  phosphorylated  residues  become  docking  sites  for  multi¬ 
ple  proteins,  which  in  turn  activate  downstream  signaling 
pathways  including  the  PI3K/AKT  prosurvival,  STAT  tran¬ 
scription,  and  RAS/RAF/MEK  proliferation  pathways. 

Eleven  members  of  the  EGF  family  have  been  identified  as 
ligands  for  the  EGFR  family.  HER2  is  not  ligand  activated  be- 
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cause  of  its  unique  extracellular  spatial  structure  but  is  the  pre¬ 
ferred  dimerization  partner  for  other  family  members;  its 
heterodimers  preferentially  enhance  ligand  binding  (7). 
EGFR3  is  "kinase  dead"  (i.e.,  it  lacks  intrinsic  kinase  activity) 
and,  as  with  HER2,  functions  via  heterodimerization.  The  EGF 
ligands  show  specificity  for  multiple  homodimers  or  heterodi¬ 
mers  (7).  Epiregulin  is  a  pan-EGFR  family  ligand  that  prefer¬ 
entially  activates  heterodimeric  receptor  complexes  (8).  The 
EGF  ligands  are  produced  as  transmembrane  precursors  that 
are  cleaved  into  their  soluble  forms  by  proteases  ("shed- 
dases")  of  the  ADAM  family  (especially  ADAM10  and 
AD  AMI  7)  or  by  matrix  metalloproteinases,  a  process  known 
as  ectodomain  shedding  (9).  Other  receptor  pathways  also 
may  activate  EGFR  signaling  by  activating  the  EGFR  pathway 
via  "cross  talk"  and/or  "transactivation."  An  important  new 
example  of  this  with  relevance  to  EGFR  is  the  inflammatory 
cytokine  interleukin-6,  which  activates  the  Janus-activated 
kinase/ signal  transducer  and  activator  of  transcription  sys¬ 
tem,  which  in  turn  activates  EGFR  pathway  signaling.  High 
levels  of  interleukin-6  have  been  described  in  many  cancers, 
including  EGFR-mutant  lung  cancers,  providing  an  additional 
method  for  EGFR  activation  and  a  new  therapeutic  target. 

NSCLC  cells  can  produce  and  release  several  of  the  EGF 
ligands  (10-12).  Under  certain  circumstances,  the  mem¬ 
brane-anchored  isoforms  and  soluble  growth  factors  also 
may  act  as  biologically  active  ligands.  Therefore,  depending 
on  the  circumstances,  these  ligands  may  induce  juxtacrine, 
autocrine,  paracrine,  and/or  endocrine  signaling  (13).  Estab¬ 
lishing  EGFR  autocrine  loops  renders  the  cells  sensitive  to 
inhibition  by  tyrosine  kinase  inhibitors  (10,  12).  Zhou  et  al. 
(14)  described  the  presence  of  an  autocrine  heregulin-EGFR3 
loop  associated  with  up-regulation  of  the  sheddase 
ADAM10.  Inhibiting  ADAM10  with  a  specific  inhibitor  pre¬ 
vented  the  processing  and  activation  of  multiple  EGF  li¬ 
gands.  Recent  reports  indicate  that  breast  and  NSCLC  cells 
(especially  those  with  EGFR  mutations)  may  produce  large 
amounts  of  interleukin-6,  activating  another  autocrine  loop 
that  drives  tumorigenesis  (15,  16). 

Mutations  of  EGFR  may  target  many  regions  of  the  gene, 
especially  the  extracellular  domain  in  glioblastomas  (17)  and 
the  kinase  domain  in  lung  cancers  (18,  19).  EGFR  mutations 
may  play  a  major  role  in  lung  tumorigenesis  but  also  leave 
lung  tumor  cells  dependent  on  EGFR  signaling  pathway  acti¬ 
vation  for  growth  and  survival  ("oncogene  addiction";  refs. 
19,  20).  Therefore,  inhibition  of  EGFR  signaling  by  tyrosine  ki¬ 
nase  inhibitors  rapidly  leads  to  apoptosis  and  growth  cessa¬ 
tion.  In  the  4  years  since  the  discovery  of  the  mutations, 
however,  it  was  realized  that  primary  tumor  response  and  re¬ 
sistance  to  tyrosine  kinase  inhibitors  are  influenced  by  many 
factors,  including  mutations,  mutation  type,  and  copy  num¬ 
bers  of  EGFR ;  EGFR3  activation;  KRAS  mutations;  MET  ampli¬ 
fication,  and  others  (21-23).  Therefore,  although  some  studies 
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(usually  from  single  institutions  analyzing  highly  selected 
patient  populations)  have  shown  very  high  response  rates 
of  EGFR-mutant  tumors  to  tyrosine  kinase  inhibitors,  large 
multi-institutional  clinical  trials  have  often  failed  to  show  a 
survival  benefit  of  this  approach,  although  increased  copy 
number  of  EGFR  (and  HER2  in  some  series)  was  associated 
with  a  good  treatment  outcome  (24,  25).  Although  EGFR 
mutations  and  copy  number  gains  may  occur  independently, 
they  occur  together  more  frequently  than  alone  (26,  27).  In 
addition,  as  with  glioblastomas  (17),  the  mutant  allele  is  pre¬ 
ferentially  amplified  in  such  cases  (26).  Therefore,  "triple 
whammy"  tumors  (i.e.,  those  with  mutations,  copy  number 
gains,  and  mutant  allele-specific  amplifications)  are  in  all 
probability  highly  oncogene  addicted  and  likely  to  show  dra¬ 
matic  and  sustained  responses  to  appropriate  targeted  thera¬ 
pies.  Autocrine  loops  and  other  derangements  of  EGFR 
signaling  are  frequent  in  all  forms  of  NSCLC,  which  therefore 
may  involve  tumors  with  more  than  three  EGFR  aberrations, 
or  "multiple  whammy"  tumors. 

The  finding  that  all  of  these  different  mechanisms  activate 
EGFR  signaling  in  lung  cancers  signifies  the  presence  and 
great  importance  of  strong  selective  pressures  on  the  EGFR 
signaling  pathway  in  these  cancers.  This  selectivity  was  dra¬ 
matically  highlighted  by  the  finding  of  EGFR  tyrosine  kinase 
domain  mutations,  but  lung  cancer  use  of  all  these  alternative 
mechanisms  is  equally  important  in  underscoring  the  key  role 
of  the  EGFR  pathway  in  driving  lung  cancer  pathogenesis.  Of 
course,  these  findings  also  highlight  how  versatile  tumor  cells 
are  in  finding  ways  to  activate  the  pathway.  On  a  related  note, 
the  relapse  and  subsequent  drug  resistance  of  lung  cancers 
that  had  responded  to  EGFR-targeting  drugs  (such  as  EGFR 
tyrosine  kinase  inhibitors)  show  the  resourcefulness  of  these 
cancers  in  finding  other  ways  to  use  the  EGFR  or  other  path¬ 
ways  (e.g.,  KRAS ,  c-MET)  to  ward  off  extinction.  Relapse  and 
resistance  also  highlight  the  need  for  tools  that  can  determine 
whether  the  pathway  is  active  in  and  identify  "sensitive"  ther¬ 
apeutic  target(s)  for  individual  lung  cancers.  It  is  also  impor¬ 
tant  to  realize  that  the  target  is  constantly  changing,  and  thus 
different  therapeutic  options  are  needed  at  different  disease 
stages. 

We  now  evaluate  the  contributions  of  the  articles  by  Zhang 
et  al.  (4)  and  Tang  et  al.  (5)  in  the  context  of  the  EGFR  signal¬ 
ing  background  detailed  above.  Lung  cancer  has  a  high  mor¬ 
tality  that  usually  is  due  to  the  development  of  metastatic 
lesions.  Although  relatively  few  studies  have  directly  com¬ 
pared  the  molecular  changes  in  primary  tumors  with  those 
in  corresponding  metastatic  tumors,  the  metastatic  phenotype 
is  characterized  by  changes  in  multiple  cellular  pathways  (28). 
The  study  by  Zhang  et  al.  (4)  was  stimulated  by  previous 
work  from  their  laboratory  showing  that  epiregulin  is  one  of 
the  several  highly  expressed  EGF  ligands  in  EGFR-mutant 
NSCLC  cells  (10).  This  group  tested  the  hypothesis  that  epir¬ 
egulin  is  involved  in  the  development  of  the  metastatic  phe¬ 
notype.  Immunostaining  studies  confirmed  their  previous 
observation  that  primary  NSCLC  tumors  with  localized  dis¬ 
ease  stages  frequently  (in  65%  of  cases)  expressed  the  ligand. 
They  reported  a  significant  correlation  between  ligand  expres¬ 
sion  and  advanced  nodal  stage  (stage  II)  and  a  trend  toward 
shorter  survival.  In  vitro  studies  confirmed  the  role  of  epiregu¬ 
lin  in  promoting  tumor  growth  and  invasion.  These  analyses 
show  a  clear  role  for  epiregulin  in  tumor  cell  survival,  inva¬ 


sion,  and  metastasis.  Because  epiregulin  can  stimulate  multi¬ 
ple  members  of  the  EGFR  receptor  family,  activation  of  both 
EGFR  and  EGFR3  signaling  may  contribute  to  carcinogenesis. 
Because  ligand  expression  is  much  more  frequent  than  are 
EGFR  mutations  or  copy  number  gains,  these  findings  provide 
further  evidence  that  autocrine  loops  may  be  the  major 
mechanism  by  which  EGRF  signaling  is  deregulated  in  all  his¬ 
tologic  forms  of  NSCLC.  Future  studies  should  comprehen¬ 
sively  analyze  all  11  EGF  ligands  found  in  lung  cancers 
because  other  members  of  this  ligand  group  may  have  similar 
tumor-promoting  actions. 

As  mentioned  earlier,  EGFR  mutations  and  copy  gains  occur 
frequently  in  the  same  tumors.  Previous  studies  have  shown 
widespread  field  effects  throughout  the  respiratory  epithelium 
of  smokers  (29, 30),  suggesting  that  tobacco  exposure  damages 
the  entire  respiratory  epithelium.  Most  EGFR  mutations  occur 
in  lung  cancers  of  lifetime  never  smokers,  which  have  a  largely 
unknown  etiology  (31).  In  their  earlier  work,  the  authors  care¬ 
fully  microdissected  histologically  normal  respiratory  epithe¬ 
lium  from  small  airways  surrounding  mutation-containing 
tumors  (32);  often  present  in  airways  within  or  near  the  tumor 
but  seldom  in  distant  sites,  the  mutations  reflected  a  limited 
field  effect.  Therefore,  exposure  and  damage  seem  to  be  much 
more  limited  in  never  smokers  than  in  current  or  former 
smokers.  In  their  present  study,  Tang  et  al.  conducted  a  more 
extensive  field  study,  assessing  the  presence  of  mutations  and 
copy  number  gains  (by  fluorescence  in  situ  hybridization 
technique)  in  primary  NSCLC,  corresponding  metastases, 
and  histologically  normal  respiratory  epithelium.  As  in  their 
previous  study,  mutations  and  EGFR  protein  overexpression 
were  a  localized  field  effect.  The  key  present  findings  are  that 
copy  number  gains  were  absent  in  normal  epithelium  and 
were  distributed  heterogeneously  in  primary  tumors  and  more 
evenly  in  metastases.  Tang  et  al.  (5)  have  answered  the  ques¬ 
tion,  "Which  came  first,  the  chicken  (copy  number  gains)  or 
the  egg  (mutations)?"  The  finding  of  mutant  allele-specific 
gains  gives  the  nod  to  the  egg. 

The  prototype  EGFR  gene  is  not  the  only  EGFR  pathway 
gene  amplified  in  NSCLC.  A  recent  report  describes  amplifica¬ 
tion  of  other  pathway  members  including  HER2,  SHC1,  and 
AKT  (33).  Our  unpublished  work  indicates  that  other  pathway 
genes  including  KRAS  and  BRAE  may  also  be  amplified  in 
NSCLC.  Although  mutations  of  pathway  genes  are  usually 
mutually  exclusive,  single  tumors  may  contain  copy  number 
gains  for  multiple  genes  or  a  single  pathway  mutation  and  one 
or  more  pathway  gene  copy  number  gains.1 

Two  other  recently  published  studies  (34,  35)  are  consistent 
with  the  findings  of  Tang  et  al.  (5).  Cancers  arise  as  a  result  of 
multistage  processes,  and  a  lesion  known  as  atypical  adeno¬ 
matous  hyperplasia  is  recognized  as  a  precursor  or  premalig- 
nant  lesion  for  peripheral  lung  adenocarcinomas.  Atypical 
adenomatous  hyperplasia  lesions  progress  to  noninvasive 
cancers  known  as  bronchioloalveolar  carcinomas  as  defined 
by  the  strict  criteria  of  WHO  classification  (36).  Bronchioloal¬ 
veolar  carcinoma  tumors  may  become  invasive  and  eventually 
metastatic.  Early  invasive  cancers  may  contain  invasive  and 
noninvasive  components  that  can  be  microdissected  and  ex¬ 
amined  separately.  By  examining  the  various  stages  of  lung 


1  Unpublished  data. 
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pathogenesis  for  EGFR  mutations  and  copy  gains,  both  re¬ 
ports  (34,  35)  conclude  that  mutations  are  early,  preinvasive 
changes,  whereas  copy  number  gains  are  later  events  asso¬ 
ciated  with  the  invasive  phenotype  (Fig.  1). 

All  of  these  findings  are  consistent  with  the  hypothesis  that 
mutations  precede  copy  number  gains,  which  may  be  asso¬ 
ciated  with  the  metastatic  phenotype.  Therefore,  mutations 
are  likely  to  show  little  or  no  heterogeneity  in  primary  or 
metastatic  tumors,  and  copy  number  gains  may  be  absent  or 


heterogeneously  distributed  in  primary  tumors  and  relatively 
evenly  distributed  within  metastatic  sites.  Further  work  will 
be  needed  to  confirm  that  copy  number  gains  are  part  of  the 
metastatic  phenotype. 

What  are  the  clinical  implications  of  these  findings?  The 
data  of  Zhang  et  al.  (4)  suggest  that  about  two  thirds  of  all 
NSCLCs  express  at  least  one  of  the  EGF  ligands.  Testing  the 
expression  of  the  other  10  known  ligands  in  this  cohort  pre¬ 
sumably  would  have  shown  an  even  higher  percentage.  The 


AAH  (preneoplastic) 


BAC  (preinvasive) 


Invasive  adenocarcinoma 


B 


EGFR  Mutations 


Histologically 

normal 


Preneoplastic 

(AAH) 


Preinvasive 

(BAC) 


Invasive 


Metastatic 


EGFR  amplifications 


Epiregulin  loops 


Fig.  1.  Deregulation  of  the  EGFR  gene  during  the  multistage  pathogenesis  of  peripheral  lung  adenocarcinomas.  A,  peripheral  adenocarcinomas  are  believed  to  arise 
from  preneoplastic  lesions  known  as  atypical  adenomatous  hyperplasias  [AAH),  which  first  progress  to  a  preinvasive  neoplastic  stage  called  bronchioloalveolar 
carcinoma  [BAC).  Foci  of  invasion  may  develop  in  the  fibrotic  centers  of  bronchioloalveolar  carcinomas,  which  then  are  called  invasive  adenocarcinomas,  although 
noninvasive  elements  may  persist  at  the  edges  of  the  tumors.  Metastases  ultimately  develop  (not  shown).  B,  from  the  article  by  Tang  et  al.  (5)  and  from  the  literature 
cited  in  the  text,  EGFR  mutations  commence  early  during  pathogenesis  and  can  be  detected  in  histologically  normal  respiratory  epithelium  near  tumors  (localized  field 
effect).  Mutations  are  more  frequent  in  preneoplastic  (atypical  adenomatous  hyperplasia)  and  preinvasive  (bronchioloalveolar  carcinoma)  stages  than  in  normal 
epithelium.  Therefore,  there  is  relatively  little  heterogeneity  of  mutations  in  invasive  carcinomas,  and  the  mutations  contribute  to  tumor  pathogenesis.  In  contrast,  gene 
copy  number  gains,  often  in  the  form  of  amplifications,  commence  relatively  late  in  pathogenesis,  usually  at  the  tumor  stage.  They  are  more  frequent  in  metastatic 
lesions,  suggesting  that  they  may  be  progression  events  involved  in  the  metastatic  phenotype.  Much  less  is  known  about  the  timing  of  epiregulin  loops  (either 
autocrine,  paracrine,  or  juxtacrine).  From  the  data  of  Zhang  et  al.  (4),  however,  it  would  seem  that  epiregulin  loops  can  be  detected  in  primary  invasive  tumors  but  are 
more  frequent  or  active  during  the  metastatic  stage.  The  dashed  line  indicates  that  the  timing  of  the  appearance  of  these  loops  during  earlier  preinvasive  stages  is 
unknown. 
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expression  of  EGFR  protein  in  most  NSCLCs,  including  squa¬ 
mous  cell  carcinomas,  raises  the  question  of  what  mechanism 
causes  deregulation.  Mutations  and  copy  number  gains 
explain  only  a  minority  of  these  cases  and  probably  are  not 
important  mechanisms  in  squamous  cell  carcinomas.  As  sug¬ 
gested  by  the  data  of  Zhang  et  al.  (4),  activation  of  autocrine 
(or  paracrine  or  juxtacrine)  loops  is  an  attractive  alternative 
mechanism.  If  this  loop  is  dependent  on  continued  EGFR  sig¬ 
naling  and  is  inhibited  by  tyrosine  kinase  inhibitor  therapy,  as 
suggested  by  the  data,  this  would  be  a  plausible  explanation 
for  why  some  nonmutant  tumors  of  all  histologic  types  with 
nearly  diploid  copy  number  respond  to  tyrosine  kinase  inhibi¬ 
tor  therapy  (24,  37).  Future  retrospective  and  prospective  stu¬ 
dies  are  needed  to  determine  whether  EGF  ligand  expression 
is  an  additional  predictive  factor  for  tyrosine  kinase  inhibitor 
response.  The  concept  that  the  driving  force  behind  many  or 
most  NSCLC  tumors  is  EGF  ligand  receptor  loops  offers  the 
clinician  the  following  additional  avenues  for  potential  tar¬ 
geted  therapies:  preventing  sheddase  up-regulation  or  activ¬ 
ity,  preventing  ligand  production  directly  or  by  inhibition  of 
the  loop  at  a  more  upstream  stage,  targeting  the  soluble  form 
of  the  ligand,  and  preventing  ligand-receptor  interaction. 

With  the  identification  of  deregulated  expression  of  EGF 
family  ligands  in  lung  cancer  pathogenesis,  we  can  now 
consider  using  the  relevant  ligands  for  early  cancer  diagno¬ 
sis,  identifying  key  therapeutic  targets,  and  as  biomarkers  to 
monitor  response  to  chemoprevention  or  very  early  treat¬ 
ment.  Because  the  ligands  are  soluble,  they  potentially  could 
be  detected  in  blood  or  bronchial  lavage  specimens  in  addi¬ 
tion  to  biopsy  and  brushing  specimens.  Furthermore,  while 
exploring  their  diagnostic  and  therapeutic  targeting  roles, 
we  need  to  understand  the  molecular  mechanisms  leading 
to  the  deregulated  expression  of  these  ligands.  Copy  num¬ 
ber  changes,  mutations,  promoter  alterations  (including  epi¬ 
genetic  changes),  the  role  of  specific  transcription  factors 


(such  as  the  lineage-specific  oncogene  TITF1),  and  altered 
miRNA  expression  are  all  potential  mechanisms  that  need 
to  be  explored,  as  does  ligand  expression  in  cancer  stem 
cells. 

Another  major  clinical  interest  is  to  understand  the  sequen¬ 
tial  appearance  of  molecular  changes  during  multistage 
pathogenesis.  The  appearance  of  EGFR  mutations  at  a  prein- 
vasive  and  even  at  a  premalignant  phase  creates  opportu¬ 
nities  to  use  EGFR  mutation  markers  for  risk  identification, 
early  detection,  and  prevention,  particularly  for  never  smo¬ 
kers,  who  are  at  most  risk  for  EGFR-mutant  tumors  and 
for  whom  no  such  markers  currently  exist  (31).  Early  EGFR 
mutations  also  have  important  implications  for  the  study  of 
EGFR  tyrosine  kinase  inhibitors  in  the  adjuvant/second  pri¬ 
mary  tumor  prevention  setting.  Whereas  mutations  seem  to 
be  initiating  events,  copy  number  gains  are  related  to  pro¬ 
gression  and  metastatic  events.  Therefore,  heterogeneity 
may  occur  both  within  the  primary  tumor  and  between  the 
primary  tumor  and  metastatic  sites.  These  considerations  are 
important  if  copy  number  gains  are  used  as  a  marker  for  se¬ 
lecting  targeted  therapies,  and  they  indicate  the  importance 
of  testing  for  this  marker  in  tumor  samples  obtained  imme¬ 
diately  before  therapy  versus  relying  on  marker  data  from 
earlier  samples. 

The  reports  of  Zhang  et  al.  and  Tang  et  al.  in  this  issue  of  the 
journal  shed  new  light  on  the  highly  complex,  multifaceted, 
and  as  yet  incompletely  understood  nature  of  the  EGFR  sig¬ 
naling  pathway.  This  pathway  in  NSCLCs  and  in  the  bron¬ 
chial  epithelium  of  patients  at  a  high  lung-cancer  risk  will 
be  a  critical  focus  of  diagnostic,  preventive,  and  therapeutic 
efforts  for  the  foreseeable  future. 
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Confidence  Intervals  of  Interaction  Index 
for  Assessing  Multiple  Drug  Interaction 


J.  Jack  Lee  and  Maiying  Kong 


Studies  of  interactions  among  biologically  active 
agents  have  become  increasingly  important  in  many 
branches  of  biomedical  research.  We  consider  that  the 
Loewe  additivity  model  is  one  of  the  best  general  ref¬ 
erence  models  for  defining  drug  interactions.  Based  on 
the  Loewe  additivity  model,  synergy  occurs  when  the 
interaction  index  is  less  than  one,  and  antagonism  oc¬ 
curs  when  interaction  index  is  greater  than  one.  Starting 
from  the  Loewe  additivity  model  and  the  marginal  dose- 
effect  curve  for  each  drug  involved  in  a  combination,  we 
first  present  a  procedure  to  estimate  the  interaction  in¬ 
dex  and  its  associated  confidence  interval  at  a  combina¬ 
tion  dose  with  observed  effects.  Following  Chou  and  Ta- 
lalay’s  method  for  assessing  drug  interaction  based  on 
the  plot  of  interaction  indices  versus  effects  for  combi¬ 
nation  doses  at  a  fixed  ray,  we  then  construct  a  pointwise 
(1  —  a)  x  100%  confidence  bound  for  the  curve  of  interac¬ 
tion  indices  versus  effects.  We  found  that  these  methods 
work  better  on  the  logarithm  transformed  scale  than  on 
the  untransformed  scale  of  the  interaction  index.  We  pro¬ 
vide  simulations  and  case  studies  to  illustrate  the  perfor¬ 
mances  of  these  two  procedures,  and  present  their  pros 
and  cons.  We  also  provide  S-Plus/R  code  to  facilitate  the 
implementation  of  these  two  procedures. 

Key  Words:  Antagonism;  Loewe  additivity  model;  Synergy. 


1.  Introduction 

Studies  of  interactions  among  biologically  active 
agents,  such  as  drugs,  carcinogens,  or  environmental  pol¬ 
lutants,  have  become  increasingly  important  in  many 
branches  of  biomedical  research.  Our  research  group  re¬ 


viewed  the  literature  (Lee,  Kong,  Ayers,  and  Lotan  2007) 
and  agree  with  many  researchers  (e.g.,  Berenbaum  1985, 
1989;  Greco,  Bravo,  and  Parsons  1995)  that  the  Loewe 
additivity  model  should  be  considered  as  the  “gold  stan¬ 
dard”  for  defining  drug  interactions 

For  a  combination  of  k  drugs  ( k  >  2)  at  (d\, . . . ,  <4), 
based  on  the  Loewe  additivity  model,  drug  interactions 
at  this  combination  can  be  characterized  as 


<  1, 

synergy; 

=  1, 

additivity; 

(i) 

.>  1, 

antagonism. 

Here  d\, ...  ,dk  are  doses  of  each  drug  in  the  mix¬ 
ture  of  the  k  drugs  resulting  in  effect  y,  and  Dy\,  . . 
Dy  k  are  the  doses  of  drugs  that  result  in  the  effect  y 
for  each  respective  drug  given  alone.  The  summation, 
tt1 — F  •  •  •  +  ,  is  called  the  interaction  index,  which 

is  denoted  as  z.  Based  on  the  Loewe  additivity  model, 
the  combination  dose  (d\, ... ,  <4)  is  said  to  be  syner¬ 
gistic  if  the  interaction  index  is  less  than  the  constant 
number  of  1,  and  additive  or  antagonistic  if  the  index  is 
equal  to  or  greater  than  1,  respectively.  To  give  an  in¬ 
tuitive  idea  about  the  interaction  index,  we  illustrate  its 
meaning  in  the  special  case  of  k  =  2.  Note  that  the  com¬ 
bination  dose  ( d\ ,  di)  produces  the  same  effect  y  as  drug 
1  alone  at  dose  level  Dy\,  and  drug  2  alone  at  dose  level 
Dy  2,  which  implies  that  1  unit  of  drug  2  will  produce  the 
same  effect  as  units  of  drug  1.  Thus,  the  amount  of 

dose  at  the  combination  ( d\ ,  di)  equals  to  d\  +  di 
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in  terms  of  drug  1  dose.  By  definition,  r  = 

which  implies  that  d\  +  =  tD^i.  r  <  1  implies 

di  +  d2jp^  =  rD-yj  <  Dy  i,  therefore,  the  amount  of 
the  combination  dose  (d\ ,  di)  to  produce  the  same  effect 
y  is  less  than  the  amount  of  dose  when  single  drug  is 
applied,  hence,  indicating  synergy.  The  smaller  r  is,  the 
more  is  the  reduction  of  the  amount  of  dose  in  the  combi¬ 
nation,  and  the  stronger  is  the  synergy.  Similarly,  r  >  1 
implies  that  d\  +  <^27^  =  T^y,l  >  Dy,  1,  that  is,  the 
amount  of  the  combination  dose  (d\ ,  J2)  producing  the 
same  effect  y  is  more  than  each  single  drug  dose,  hence 
indicating  antagonism.  The  geometric  interpretation  of 
the  interaction  index  can  be  best  shown  graphically  in 
Figure  1,  panels  (A)  and  (B),  where  P  =  (Z>y?i,0), 
Q  =  (0,  Dy  2),  U  =  (d\,  d2),  all  yield  the  same  effect  y. 
If  we  draw  a  line  RS  passing  through  U  and  parallel  to 
the  line  PQ,  and  draw  a  line  OU  intercepting  with  P  Q 
at  V ,  then,  from  the  basic  geometric  properties,  the  inter¬ 
action  index  can  be  expressed  as  the  ratio  of  |ength^^;> .  In 
Figure  1(A),  the  closer  the  point  U  is  toward  the  origin, 
the  less  is  the  amount  of  combination  dose  required  to 
produce  the  same  effect  as  drug  1  alone  at  Dy  \  or  drug 
2  alone  at  Dy2,  hence,  the  stronger  is  the  synergy.  By 
the  same  token  in  Figure  1(B),  the  further  the  point  U  is 
away  from  the  origin,  the  larger  is  the  amount  of  combi¬ 
nation  dose  required  to  produce  the  same  effect  as  drug  1 
alone  at  Dy\  or  drug  2  alone  at  Dy2,  hence,  the  stronger 
is  the  antagonism.  From  Figure  1,  we  conclude  that  the 
interaction  index  can  be  used  to  measure  the  mode  and 
magnitude  of  drug  interactions. 

Given  the  combination  dose  (d\, . . . ,  df)  and  its  ef¬ 
fect  y,  and  the  marginal  dose-effect  curves  fi(Di)  for 
drug  i  (i  =  1,  ...,&),  the  calculation  of  the  interaction 
index  at  a  combination  dose  (d\, ... ,  dk)  is  straightfor¬ 
ward.  One  simply  replaces  Dy  j  by  /-_1(y),  where  f~l  is 
the  inverse  function  of  f)  (i  =  1, ...  ,k).  However,  since 
the  dose-effect  curves  are  usually  estimated  and  the  ef¬ 
fect  y  is  observed  with  error,  to  make  valid  inferences 
for  drug  interactions,  one  needs  to  account  for  all  these 
variabilities.  In  other  words,  one  needs  to  consider  the  es¬ 
timated  interaction  indices  along  with  their  variances  to 
make  valid  statistical  inferences  on  drug  interaction. 

In  most  settings,  the  functional  form  of  the  marginal 
dose  response  curves  may  not  be  known  and  need  to  be 
estimated  from  the  data.  Chou  and  Talalay  (1984)  pro¬ 
posed  the  median-effect  equation  which  has  been  widely 
used  to  model  the  dose-effect  curve  with  good  success 
(Chou  2006).  In  our  cell  line  study  (Kong  and  Lee  2006; 
Lee  et  al.  2007),  we  also  find  that  the  median-effect  equa¬ 


tion  fits  the  data  well.  Chou  and  Talalay ’s  median-effect 
equation  has  the  following  form 


where  d  is  the  dose  of  a  drug  eliciting  effect  E ,  Dm  is  the 
median  effective  dose  of  a  drug,  and  m  is  a  slope  parame¬ 
ter  depicting  the  shape  of  the  curve.  When  m  is  negative, 
the  curve  described  by  Equation  (2)  falls  with  increasing 
drug  concentration;  when  m  is  positive,  the  curve  rises 
with  increasing  drug  concentration.  The  median-effect 
Equation  (2)  is  independent  of  the  drug’s  mechanisms  of 
action  and  does  not  require  knowledge  of  conventional 
kinetic  constants  (Chou  2006;  Greco  et  al.  1995).  Under 
the  assumption  that  the  dose-effect  curves  follow  Chou 
and  Talalay ’s  median-effect  equation,  in  Section  3  we  in¬ 
vestigate  the  characteristics  of  the  interaction  index  and 
its  logarithmic  transformation,  and  propose  a  procedure 
to  construct  the  confidence  interval  for  the  estimated  in¬ 
teraction  index. 

Although  interaction  index  can  be  estimated  at  each 
combination  dose  separately,  this  approach  is  not  effi¬ 
cient.  The  result  tends  to  be  more  varying  as  it  depends 
on  only  measurement  at  a  single  combination  dose  level. 
To  gain  efficiency,  one  can  assume  a  model  and  pool  data 
at  various  combination  doses  to  form  a  better  estimate 
of  the  interaction  index.  One  commonly  used  approach 
is  applying  the  ray  design.  Chou  and  Talalay  (1984)  and 
Chou  (1991)  proposed  a  procedure  to  characterize  a 
two-drug  interaction  by  first  fitting  marginal  dose-effect 
curves  and  a  dose-effect  curve  for  the  combination 
doses  with  their  components  at  a  fixed  ray  (i.e.,  d\/d2=a 
constant,  forming  a  ray  in  the  d\  x  J2  dose  plane),  then 
assessing  drug  interaction  based  on  the  plot  of  their  com¬ 
bination  indices  versus  effects  for  combination  doses  at 
this  fixed  ray.  The  confidence  intervals  for  the  combina¬ 
tion  indices  were  constructed  by  Monte  Carlo  techniques 
(CalcuSyn  at  http://www.hiosoft.eom/w/calcusyn.htm ; 
Belen’kii  and  Schinazi  1994;  CompuSyn  at 
http://www.comhosyn.com/).  The  combination  index 
has  the  same  form  as  the  interaction  index  when  the 
combined  drugs  are  mutually  exclusive.  However,  Chou 
and  Talalay ’s  mutual  exclusiveness  and  nonexclusiveness 
criteria  are  difficult  to  evaluate,  and  the  combination  in¬ 
dex  has  been  criticized  by  many  researchers  (Berenbaum 
1989;  Greco  et  al.,  1995).  In  Section  3,  by  adopting  the 
interaction  index  for  Chou  and  Talalay ’s  method,  we 
extend  their  method  to  assess  drug  interactions  among 
k(>  2)  drugs,  and  propose  a  procedure  to  construct  a 
pointwise  (1  —  a)  x  100%  confidence  bound  for  the 
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A.  Synergy  B.  Antagonism 


Figure  L  Illustration  of  the  interaction  index.  P  =  (Dy  0)  represents  the  drug  1  dose  producing  effect  y,  Q  =  (0,  Dy  2)  represents  the  drug  2 
dose  producing  effect  y,  and  U  =  (d\,d2)  represents  the  combination  dose  producing  the  same  effect  y.  RS  is  the  line  passing  by  U  and  parallel 
to  PQ,  and  V  is  the  intercept  of  OU  and  PQ.  The  interaction  index  can  be  expressed  as  the  ratio  of  •  Panel  A  illustrates  the  case  for 


synergy  while  Panel  B  for  antagonism. 


estimated  curve  of  interaction  indices  versus  effects 
by  accounting  for  all  the  variabilities  in  estimating 
the  dose-effect  curves.  In  Section  4,  we  present  the 
results  from  simulations  and  case  studies  to  show  that 
our  proposed  procedure  performs  at  least  as  well  as 
the  Monte  Carlo  procedure  in  terms  of  covering  the 
underlying  curves  and  shortening  the  confidence  bound, 
and  performs  better  in  terms  of  taking  less  time  to 
compute.  The  last  section  is  devoted  to  discussion. 


2.  Interaction  Index  and  its  Confidence 
Interval  at  a  Combination  Dose 

In  this  section,  we  will  present  how  to  construct  a  con¬ 
fidence  interval  for  interaction  index  at  a  combination 
dose  (d\, ... ,  dk )  with  an  effect  y.  Here  the  dose-effect 
curve  for  drug  i  (i  =  l, ...  ,k)  is  estimated  from  the 
marginal  data  with  only  zth  drug  being  applied.  Note  that 
Chou  and  Talalay’s  median-effect  equation  (2)  also  can 
be  rewritten  as 

E 

log  - -  =  m  (log  d  -  log  Dm )  =  /?o  +  pi  log  d,  (3) 

1  —  E 


where  /?o  =  —m  log  Dm  and  f  \  =  m.  The  dose  produc¬ 
ing  effect  E  can  be  written  as  either 


(  E  \m 

d  —  Dm  (  - - -  , 

(4) 

\l  -  E  J 

or 

"““pOC-e)  ■ 

(5) 

Suppose  model  (3)  has  the  form  log  y^ 

=  A,  + 

f  i  log  d  +  c  with  £  following  Af(0,  cr2),  then  we  may 
regress  log  y^—  on  log  d  to  get  the  marginal  dose-effect 

curve  log  y^  =  fo,i  +  fiijlogd  for  drug  i  with 
i  =  1  Meanwhile  we  may  get  the  variances 

and  covariances  for  the  estimates  fio,i  and  fi\j  for  i  = 
l, ...  ,k.  If  the  observed  mean  effect  at  a  combination 
dose  (d\, ... ,  dk)  is  y,  then,  based  on  (5),  the  associated 
interaction  index  can  be  estimated  by 

k  .  k  j 

f  =  Z^  =  Z — — — <«> 

i=1  y'1 

The  simulations  in  Section  4  indicate  that  the  distribution 
of  log(r)  is  approximately  normal,  while  f  deviates  from 
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a  normal  distribution  for  large  o ’s.  Thus,  we  should  ap¬ 
ply  the  delta  method  (Bickel  and  Doksum  2001)  to  log  r 
instead  of  r ,  then  we  take  the  exponential  transformation 
to  get  the  confidence  interval  for  r.  By  applying  delta 
method  to  log(f ),  we  get 


var(log(r)) 

-  var(f) 


„  1 1 

(  di 

di 

di 

di 

di 

“  f2' 

SPi,\ 

’  dkk 

SP\,k 

’  d;y 

xS  j 

^  di 

di 

di 

di 

di' 

- ~ - ?  •  • 

•  5  - Z -  , 

\ 

{SPo,i  ’ 

SPi,\ 

8Po,k 

dkk 

dy , 

,(7) 


where 


di  di  1  di  di  l°g  \-y  0O,i 
dfioj  ~  Dyj  ki  ’  ski  -  Dy,i  fil It 

for  i  =  1 , ,k,  and 

di  _  1  /  1  di  |  |  1  dk  \ 

sy  yO- -  y)\k\  by,\  kkby,k)’ 

li  is  the  variance-covariance  matrix  of  the  2k  param¬ 
eters  (0o,\,  0\,\, . . . ,  0o,k>  0l, k)  and  the  observed  mean 
effect  y  at  (d\, ...  ,dk).  Any  two  pairs  of  parameters, 
(00, i?  0\, i)  and  (0o,j,0l,j)  when  i  /  j  are  indepen¬ 
dent  since  typically,  different  experimental  subjects  was 
used  for  drug  i  alone  and  for  drug  j  alone,  respectively. 
Further,  all  those  subjects  are  different  from  the  subjects 
administrated  the  combination  dose  (d\, ...  ,dk).  Thus, 
the  estimates  (0oj ,  0\j  )  are  independent  of  the  estimates 
(0O,j,0i,j)  when  i  /  j,  and  all  of  them  are  indepen¬ 
dent  of  the  observed  mean  effect  y  at  (d\ , . . . ,  dk).  There¬ 
fore,  £  is  a  block  diagonal  matrix  with  the  block  being  a 
2x2  matrix  except  for  the  last  diagonal  element  var(y). 
An  approximate  variance  of  log(f )  can  be  obtained  by 
var(log(f))  ±2  var(r),  where 


var(f)  ^ 


Ms:)' 


(var(/?0  i)  2cov(/?o,j>  A,(')(l°g  yzj  “  kb 

~k+  ~  % 

var(^i;J)(log  -  kb' 


+ 


*  )  , 

(1  d\  1  4  \ 

P\,\  Dy<  1  fil  J  Dy,k  ) 

(Hikt 


var(y). 


(8) 


We  can  estimate  var(y)  in  two  ways.  When  there  are 
replicates  at  the  combination  dose  (d\,  . . . ,  <4),  var(y) 
can  simply  be  estimated  by  the  sample  variance  at 
(d\,d2).  Otherwise,  we  may  borrow  the  information 
from  estimating  the  marginal  dose-effect  curves.  Note 
that  var(log  y^)  ^  (ypfzyy)2var(y),  thus,  we  may  sub¬ 
stitute  (^jz^y)2var(y)  by  the  average  of  the  squared 
residuals  obtained  from  fitting  the  median-effect  Equa¬ 
tion  (3)  for  all  drugs  involved  assuming  a  constant  vari¬ 
ance  for  both  the  single  and  combination  drug  effects. 
Once  the  variance  is  obtained,  a  (1  —  a)xl00%  confi¬ 
dence  interval  for  log(r)  can  be  formed  as 

[log(r)  -  ?„_2t;|\/var(log(f)), 

log(f)  +  tn_2k  a  >/var(log(f))J  , 

where  tn_2k  «  is  the  1  —  |  percentile  of  t -distribution  with 

n  —  2k  degree  of  freedom,  and  n  =  X/=i  ni  with  ni 
(i  =  l, ...  ,k)  being  the  number  of  observations  when 
drug  i  is  used  alone.  2k  is  the  total  number  of  estimated 
parameters  involved  in  estimating  the  interaction  index 
(6).  Thus,  a  (1  —  a)  x  100%  confidence  interval  for  r  can 
be  constructed  as: 

f  exp  «  \/var(log(r)))  , 

i  exp  Aar(log(f)))]  .  (9) 


When  var(log(f ))  is  small,  we  have 


r  exp 


(±f— at.  §  varOogCf  ») 

-  2J 2  V var(f 


±2  z  exp 


-f  ±f„-2.,|Vvar(f). 


Therefore,  if  the  error  in  (3)  is  small,  the  confidence  in¬ 
terval  for  r  based  on  (9)  is  essentially  the  same  as  the 
confidence  interval  constructed  by  directly  applying  the 
delta  method  to  f ,  which  is 


_f  -  K-2 *,«Vvar(r),  f  VvaKfi]  •  (10) 

In  Section  4,  we  illustrate  that,  for  a  large  error  in  (3), 
the  confidence  interval  (9)  behaves  better  than  (10)  in 
two  aspects:  (i)  the  lower  limit  is  greater  than  zero  all 
the  time;  and  (ii)  the  confidence  interval  has  a  coverage 
rate  that  is  closer  to  the  nominal  rate.  Therefore,  the  con¬ 
fidence  interval  (9)  is  preferred.  When  n  —  2k  is  large,  say 
n  —  2k  >  20,  one  may  use  instead  of  tn_2k  „  in  esti¬ 
mating  the  confidence  intervals  (9)  and  (10),  where  z«  is 
the  1  —  |  percentile  of  the  standard  normal  distribution. 
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3.  Interaction  Indices  and  Their  Confidence 
Bound  at  a  Fixed  Ray 


3.1  Two-Drug  Combination 


Note  that  the  confidence  interval  (9)  is  based  on  a  sin¬ 
gle  observation  and  the  marginal  dose-effect  curves,  and 
the  estimated  interaction  index  and  its  confidence  interval 
are  greatly  influenced  by  this  single  observation.  Chou 
and  Talalay  (1984)  used  a  ray  design  to  assess  drug  in¬ 
teractions.  The  advantage  of  their  method  is  that  it  uses 
all  observations  with  the  component  doses  at  a  fixed  ray. 
We  adopt  the  interaction  index  instead  of  the  combination 
index  when  using  their  approach.  The  basic  idea  (Chou 
1991)  is  to  regress  log  on  log  D  for  each  of  the  two 
drugs  used  alone  and  regress  log  on  log(di  +  J2) 
for  the  combination  doses  (^1,^2)  with  ^  say, 

log  jzrp  =  fo,c  +  f\,c  log  Dc.  Then  for  each  fixed  effect 
y,  one  may  estimate  the  interaction  index  by 


f)  _ 

uy>ccoi+(a2 

Dy,  1 


f)  0J2 

Uy>ca>i+(D2 

Dy,  2 


(ID 


1 

where  Dy,i  =  exp(-|^)  (y^)^1’1  ,  Dy,2  = 

exp(“te)(*)*- and  =  exi,(“fc) 

Commercial  software  CalcuSyn  and  Com- 
puSyn  are  available  for  estimating  the  interaction  indices 
and  their  confidence  intervals.  The  confidence  intervals 
for  interaction  indices  in  (1 1)  were  constructed  based  on 
Monte  Carlo  techniques  and  the  normal  assumption  on 
the  parameters  (Belen’kii  and  Schinazi  1994).  Briefly, 
the  parameters,  (foj,  f\j)  for  i  =  1,  2,  c,  and  the  inter¬ 
action  index  (11)  are  estimated  from  the  observed  data, 
then  certain  number  of  random  samples  (say,  500)  of  the 

parameters,  for  j  =  1,...,500  and  i  = 

1,  2,  c,  are  generated  based  on  their  estimated  values  and 
covariances  under  the  normal  assumption  on  each  pair 
of  these  parameters.  Thus,  500  interaction  indices,  zCT  . 
(j  =  1, ,  500),  can  be  calculated  and  its  standard  devi¬ 
ation  can  be  estimated  as  j  ~  *ct)2- 

Consequently,  the  confidence  interval  can  be  constructed 
as  [zCT  -z«aT,  *ct  +  In  the  simulation  and  case 

studies  in  Section  4,  we  used  t  £  a  instead  of 

ni+n2+nc-6,  2 

since  the  number  of  observations  is  small. 

In  the  following  subsection,  we  extend  Chou  and  Ta- 
lalay’s  method  to  k(>  2)  drugs,  estimate  drug  interaction 
at  a  fixed  ray,  say  d\  :  d^  :  •  •  •  :  dk  =  co\  :  CD2  :  •  •  •  :  cok , 
and  construct  a  (1  —  a)  x  100%  confidence  interval  for  the 
constructed  interaction  index  at  each  effect  y.  Thus,  by 
varying  y,  a  pointwise  confidence  bound  for  the  curve  of 


interaction  indices  versus  effects  with  combination  doses 
at  the  fixed  ray  can  be  constructed  by  using  the  delta 
method. 


3.2  A>drug  combination 

Again,  we  assume  that  the  fitted  dose-effect  curve 
is  log  =  fio ,i  +  P\,i  log  Dt  +  6  for  drug  i  with 
i  =  1, . . . ,  k.  The  fitted  dose-effect  curve  for  the  mix¬ 
ture  with  their  component  doses  at  a  fixed  ray  with  d\  : 
d2  :■■■■■  dk  =  (o\  :  co2  :  •  •  •  :  cok  is  log  = 

+  fil,c  log  Dc  +  e  with  Dc  =  d\  +  d2  +  •  •  •  +  dk. 
Then,  for  each  fixed  effect  y,  one  may  estimate  interac¬ 
tion  index  by 


D 


CO  1 


y>C  CD\Jr...+(Qk 


D 


(Ok 


D 


+  •••  + 


y>C  COl+...+CQk 


D 


y,k 


(12) 


where  Dyj  =  exP  ^°r  *  = 

1,  •  •  • ,  k,  c.  Again,  (jio,i,Pi,i)  and  are  inde¬ 

pendent  as  long  as  i  /  j  for  i,  j  =  1, ... ,  k,  c.  Based 
on  the  delta  method  (Bickel  and  Doksum  2001),  we  can 
obtain  an  approximate  variance  for  zCT 


var(fcr)  = 


/  dzCT  \ 

_i_  ( cy_cj_  \ 

\dby,c) 


var  (Dyj) 


var  (DyiC) 


)  var(4 

“TV  (Thcoi)D2y.J 

,•= i  Dy,i  / 


)  (13) 


with 


var  (Dyj)  = 


.  _  / 8Dy,i  8Dyj  \ 

\  dfio,i  ’  8 Pi  J  ) 


'SDyJ 

dfio  ,i 

|  dDyJ 


D: 


/ _ 1_  Piu  ~  '°g  -pj  \ 

''\  E’  Wj  ) 


i -  ki 
x  *h>Ai  I 

ph 


for  i  =  c,  respectively.  Thus,  replacing 

var (Dyj)  in  (13),  we  can  obtain  the  estimated  variance 
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for  ir 


var(fCT)  = 


■^-1  /  COi  Dy  e  \ 
i= 1  \(X?=1  0)i)by,i  ) 


/ var(/?0  i )  2c°v(Ao ,i  ,h,i) (log  Ty  ~  A),/ ) 


+ 


var(/?l;,)(log  jAy  -  /?o 


/A, 


+ 


/  by£  ( f,J'  Y\ 

vz?=M£r  4,7/ 


(var(/?0  c)  2cov(/?o,c,  A,c)(log  yAy  “  A>,c) 

"1FT+ 


+- 


var(^i;C)(log  -py  -  Po,cj‘ 


At 

) 


(14) 


(d\,  d2).  In  Section  3,  we  have  the  observations  for  com¬ 
bination  doses  at  a  fixed  ray,  then  we  fit  the  dose-effect 
curve  for  this  ray.  We  estimate  the  interaction  index  for 
each  fixed  effect,  where  the  combination  dose  produc¬ 
ing  such  an  effect  is  estimated.  Thus,  the  second  term  in 
(14)  describes  the  uncertainty  contributed  by  the  variance 
of  the  estimated  combination  dose  Dy,c,  which  could  be 
split  into  the  estimated  combination  dose 


4.  Simulations  and  Case  Studies 

4.1  Simulations 

To  examine  whether  the  confidence  intervals  proposed 
in  Sections  2  and  3  have  proper  characteristics,  we  per¬ 
formed  simulations  in  the  following  two  scenarios. 


Here,  we  prefer  the  confidence  interval  based  on  the 
delta  method  on  log(fcr)  since  log(fcr)  is  more  approx¬ 
imately  normally  distributed  than  iCT .  A  (1  —  a)  x  100% 
confidence  interval  for  zCT  can  be  constructed  by 


zCT  exp 


^  "+^  2*  2 ~2  yvar(fcr)^; 


zCT  exp 


(n+nc— 2k— 2,  y  j  ~~  ~  | 

- 7 - VVar(Tcr)  I 

ZCT  / 


(15) 


Again  n  =  2/=i  ni  and  ni  0  =  1, . . . ,  k)  is  the  number 
of  observations  when  drug  i  is  used  alone,  nc  is  the  num¬ 
ber  of  observations  for  the  combination  doses  at  a  fixed 
ray.  By  varying  y  in  different  values,  we  can  construct  a 
pointwise  (1  —  a)  100%  confidence  bound  for  the  curve 
of  interaction  indices  versus  effects.  Thus,  we  can  assess 
drug  interactions  for  combination  doses  at  the  fixed  ray 
while  considering  the  stochastic  uncertainty  in  obtaining 
the  observations. 


Remark:  Comparing  the  variances  of  estimated  inter¬ 
action  indices  in  (8)  and  (14),  we  note  that  the  first  terms 
in  both  equations  are  approximately  the  same,  while  the 
second  terms  are  markedly  different.  In  Sections  2  and  3, 
the  k  dose-effect  curves  for  all  the  drugs  involved  are  es¬ 
timated.  The  first  terms  in  both  equations  describe  the  un¬ 
certainty  contributed  by  estimating  the  k  marginal  dose- 
effect  curves.  In  Section  2,  we  estimate  the  interaction  in¬ 
dex  based  on  the  observed  mean  effect  at  a  single  combi¬ 
nation  dose,  and  oftentimes,  we  assume  the  combination 
dose  is  measured  without  error.  Under  this  setting,  the 
second  term  in  (8)  describes  the  variability  contributed 
by  the  mean  of  the  observed  effects  at  the  combination 


Scenario  1:  three  drugs,  at  a  single  combination 
dose.  In  the  first  scenario,  we  simulated  three  drugs  that 
followed  the  median-effect  Equation  (2)  with  the  same 
slope  m  =  —  1  and  different  median  effective  doses: 
Dm  i  =  1,  Dm2  =  2,  and  Dm3  =  4,  respectively.  We 
took  the  combination  dose  (d\,d2,  d 3)  with  each  com¬ 
ponent  being  one  third  of  its  associated  median  effec¬ 
tive  dose,  that  is,  (d\,  d2,d3)  =  (^p-,  = 

(^,  |).  If  the  combination  dose  is  additive,  the  ex¬ 

pected  effect  will  be  0.5.  Let  us  denote  the  interaction 
index  at  this  combination  dose  as  r,  the  effect  as  E ,  then 
based  on  Equation  (4),  we  have 

d\ _ ^ _ d2 _ ^ _ d3  _  ^ 

Dmi  (yz^)  Dm2  (yz^)  m  Dm3  (yz^)  m 

Thus,  the  effect  at  ( d\  ,d2,d3)  can  be  explicitly  expressed 
as 

_  (T_1  (^7  +  A7  +  s^)) 

i  +  tT-i 

1  ^  \L  \Dm  1  ^  Dm2  ^  Dm3  ) ) 

We  vary  r  among  (0.2,  0.4,  0.6,  0.8,  1,  1.25,  1.67,  2.5,  5). 
The  corresponding  effect  E  will  be  (0.167,  0.286,  0.375, 
0.444, 0.5, 0.556, 0.625, 0.714, 0.833),  respectively.  Note 
that  the  slope  m  is  negative,  so  the  dose-effect  curve  is 
decreasing.  If  the  effect  is  less  than  0.5,  then  the  combi¬ 
nation  dose  will  be  synergistic  and  the  interaction  index 
will  be  less  than  1 ;  whereas  if  the  effect  is  greater  than 
0.5,  the  combination  dose  will  be  antagonistic  and  the  in¬ 
teraction  index  will  be  greater  than  1 .  It  is  obvious  that 
the  farther  the  interaction  index  moves  away  from  1 ,  the 
stronger  is  the  interaction  effect. 
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Table  1.  Simulation  results  for  Scenario  1:  a  fixed  combination  dose  (d\,d,2,  df)  =  (y ,  |)  but  with  varying  interaction  indices.  The  three 

dose-effect  curves  follow  the  median-effect  equation  with  m  =  —  1  and  (Dm Dm2,  Dmf)  =  (1,  2,  4). 
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Under  the  above  setting,  we  first  generated  six  equally 
spaced  doses,  ranging  from  0.1  to  three-fold  of  the  as¬ 
sociated  median  effective  dose  for  each  drug.  We  then 
generated  the  effects  based  on  the  model  log  = 
/?0  +  fix  lo gd  +  e  with  e  ~  N(0,  a2)  for  each  drug, 
where  fio  —  —m  log  Dm  and  fi\  =  m.  We  generated  the 
observed  effect  at  the  combination  dose  (y ,  |)  with 

the  same  size  of  the  stochastic  variation.  The  total  sam¬ 
ple  size  for  each  simulated  experiment  was  19  (six  ob¬ 
servations  for  each  single  drug  and  one  observation  for 
the  combination  dose  effect).  We  fitted  each  dose-effect 
curve  based  on  the  generated  data.  Then,  for  each  r ,  we 
estimated  the  interaction  index  based  on  (6),  constructed 
its  95%  confidence  intervals  based  on  (9)  and  (10),  re¬ 
spectively,  calculated  the  length  of  the  confidence  inter¬ 
vals,  and  counted  whether  the  true  r  lies  in  the  respective 
confidence  intervals  based  on  (9)  and  (10),  and  whether 
the  confidence  interval  based  on  (9)  lies  below  1,  con¬ 
tains  1,  or  lies  above  1.  We  repeated  this  procedure  1,000 
times,  and  averaged  all  the  above  quantities.  We  summa¬ 
rized  the  results  in  Table  1  under  different  settings  for 
a\  a  =  0.1  and  g  —  0.4,  respectively.  From  Table  1, 
we  conclude  that  (a)  the  estimation  for  r  (mean(f)  in 
Table  1)  is  close  to  the  true  value  and  the  accuracy  de¬ 
creases  as  g  increases;  (b)  the  resulting  coverage  rates 
(Cov.rate.log)  based  on  confidence  interval  (9)  are  closer 
to  the  nominal  coverage  rate  of  95%  than  those  (Cov.rate) 
based  on  (10),  particularly,  when  g  is  larger;  (c)  the  aver¬ 
age  lengths  of  the  confidence  intervals  (Len.ci.log)  based 
on  (9)  and  the  average  lengths  of  the  confidence  inter¬ 
vals  (Len.ci)  based  on  (10)  increase  as  g  increases  with 
Len.ci.log  slightly  larger  than  Len.ci  to  provide  the  nom¬ 


inal  coverage  rate;  and  (d)  the  percentage  of  times  the 
model  correctly  assesses  drug  interaction  based  on  (9) 
as  synergy  (Pct.syn.log),  additivity  (Pct.add.log),  or  an¬ 
tagonism  (Pct.ant.log)  decreases  as  g  increases.  For  each 
underlying  interaction  index  among  (0.2,  0.6,  1,  1.67,  5), 
we  obtained  the  Q-Q  plot  of  the  1,000  estimated  interac¬ 
tion  indices  (Figure  2,  Columns  B1  and  B3)  as  well  as 
the  Q-Q  plot  of  the  1,000  logarithms  of  the  estimated  in¬ 
teraction  indices  (Figure  2,  Columns  B2  and  B4)  under 
the  settings  g  =  0.1  and  g  =  0.4,  respectively.  From 
Figure  2,  it  is  clear  that  for  small  g  (e.g.,  g  =  0.1  ),  both 
the  estimated  interaction  index  and  the  logarithm  of  the 
estimated  interaction  index  are  approximately  normally 
distributed.  But,  when  g  becomes  large,  say  g  =  0.4,  the 
estimated  interaction  indices  deviate  from  a  normal  dis¬ 
tribution,  while  the  logarithms  of  the  estimated  interac¬ 
tion  indices  are  still  approximately  normally  distributed. 
Therefore,  one  would  expect  that  the  delta  method  on  the 
logarithm  of  the  interaction  index  would  work  better  for 
constructing  confidence  intervals  for  interaction  indices. 
This  assertion  has  been  verified  by  the  results  of  the  cur¬ 
rent  simulation  studies.  Thus,  we  prefer  using  confidence 
interval  (9)  over  (10)  for  the  interaction  index  in  Section 
2,  and  using  the  confidence  interval  (15)  in  Section  3. 

Scenario  2:  Two  drugs,  with  a  ray  design.  The  sec¬ 
ond  scenario  involves  two  drugs  that  have  the  same  dose- 
effect  curves  as  drug  1  and  drug  2  in  the  first  scenario. 
That  is,  the  two  dose-effect  curves  follow  the  median- 
effect  Equation  (2)  with  the  same  slope  m  =  —  1,  and 
median  effective  doses:  Dm\  =  1  and  Dm2  =  2,  re¬ 
spectively.  We  assume  that  the  dose-effect  curve  for  the 


7 


Statistics  in  Biopharmaceutical  Research:  Vol.  1 ,  No.  1 


A1 :  11=0.2 


B3:  sigma=0.4 


Quantiles  of  Standard  Normal 


B4:  sigma=0.4 


Quantiles  of  Standard  Normal 


A3:  11=1 

Inter.  Index 

i.7  0.9  1.1  1.3 

X 

Log(lnter.  Index) 

-0.3  -0.1  0.1  0.3 

x 

Inter.  Index 

0.5  1.5  2.5 

r 

f 

Log(lnter.  Index) 

.5  -0.5  0.5 

x 

-2  0  2 
Quantiles  of  Standard  Normal 

-262 

Quantiles  of  Standard  Normal 

-2  0  2 
Quantiles  of  Standard  Normal 

-262 

Quantiles  of  Standard  Normal 

A4:  11=1.67 

Inter.  Index 

1.2  1.6  2.0  2.‘ 

Log(lnter.  Index) 

0.2  0.4  0.6  0.8 

X 

Inter.  Index 

2  4  6 

f 

/ 

Log(lnter.  Index) 

-0.5  0.5  1.5 

-2  0  2 
Quantiles  of  Standard  Normal 

-262 

Quantiles  of  Standard  Normal 

-2  0  2 
Quantiles  of  Standard  Normal 

-2  0  2 
Quantiles  of  Standard  Normal 

A5:  11=5 

Inter.  Index 

4.0  5.0  6.0 

X 

Log(lnter.  Index) 

1.3  1.5  1.7  1.9 

x 

Inter.  Index 

5  10  15  20  2‘ 

t 

* 

Log(lnter.  Index) 

0.5  1.5  2.5 

-2  0  2 
Quantiles  of  Standard  Normal 

-262 

Quantiles  of  Standard  Normal 

-2  0  2 
Quantiles  of  Standard  Normal 

-2  0  2 
Quantiles  of  Standard  Normal 

Figure  2.  Q-Q  plots  of  estimated  interaction  indices  (Column  B1  and  B3)  and  Q-Q  plots  of  the  logarithms  of  the  estimated  interaction  indices 
(Column  B2  and  B4)  for  1000  samples  under  Scenario  1.  Columns  B1  and  B2  show  the  Q-Q  plots  under  o  =0.1,  and  Column  B3  and  B4  show 
the  Q-Q  plots  under  o  =  0.4. 


combination  doses  (di,  £fe)  at  the  fixed  ray,  say,  ^  =  y, 
follows  the  median-effect  Equation  (2)  with  Dm  12  =  1.5 
and  myi  —  —2. 

We  generated  five  equally  spaced  doses,  ranging  from 
0.1  to  three-fold  of  the  associated  median  effective  dose 
for  each  of  the  single  drug,  and  five  equally  spaced  doses, 
ranging  from  0.5  to  three-fold  of  the  associated  median 
effective  dose  for  the  mixture  (d\ ,  J2)  at  the  fixed  ray 
with  ^  =  y  anc*  with  the  dose  in  the  median  effect  Equa¬ 
tion  (2)  being  d\  +  ^2-  We  then  generated  the  effects  as¬ 
sociated  with  these  generated  doses  based  on  the  model 
log  \~p  =  A)  +  /?i  log d  +  £  with  £  ~  Af(0,  a2)  for  the 
two  drugs  and  their  mixture  under  the  settings  a  =  0.2 
and  a  =0.4,  respectively.  The  total  sample  size  for  each 
simulated  experiment  is  15  (five  observations  for  each 


single  drug  and  five  observations  for  combination  doses). 
In  addition,  under  each  setting  for  a ,  we  generated  seven 
samples  for  illustration.  For  each  sample,  we  first  fitted 
the  dose-effect  curves  for  the  two  drugs  and  the  mix¬ 
ture,  and  then  performed  the  following  steps:  we  (a)  esti¬ 
mated  the  interaction  indices  based  on  (12)  for  42  equally 
spaced  effect  levels  between  the  range  of  0.1  to  0.95; 
(b)  constructed  their  confidence  intervals  based  on  (15) 
and  on  a  Monte  Carlo  simulation  proposed  by  Belen’kii 
and  Schinazi  (1994),  respectively;  and  (c)  estimated  in¬ 
teraction  index  (6)  and  constructed  the  confidence  inter¬ 
val  (9)  for  each  observed  combination  dose.  Figure  3  il¬ 
lustrates  plots  of  the  underlying  curve  of  the  interaction 
index  versus  effect  (solid  line),  the  pointwise  95%  confi¬ 
dence  bound  for  this  curve  based  on  (15)  (dotted  lines), 


Confidence  Intervals  of  Interaction  Index  for  Assessing  Multiple  Drug  Interaction 


the  confidence  bound  based  on  the  Monte  Carlo  simula¬ 
tion  (dashed  lines),  and  the  estimated  interaction  indices 
(dots)  and  their  confidence  intervals  (vertical  bars)  for 
the  observed  doses  based  on  (9)  for  the  seven  illustrative 
samples  under  each  setting  for  a .  From  Figure  3,  we  con¬ 
clude  that  (a)  the  pointwise  95%  confidence  bound  (dot¬ 
ted  lines)  embrace  the  true  curve  (solid  line)  well;  (b)  the 
pointwise  confidence  bounds  based  on  (15)  (dotted  lines) 
are  similar  to  those  based  on  Monte  Carlo  simulations 
(dashed  lines)  when  a  is  small,  but  perform  better  when 
o  is  large;  and  (c)  the  confidence  intervals  based  on  sin¬ 
gle  observations  (vertical  bars)  are  generally  wider  than 
both  confidence  bounds,  and  the  conclusions  based  on 
confidence  bounds  are  more  accurate  than  those  based  on 
the  confidence  intervals  (vertical  bars)  for  single  observa¬ 
tions.  In  addition,  for  each  sample,  we  calculated  the  ratio 
of  the  length  of  the  confidence  interval  based  on  (9)  and 
the  confidence  interval  based  on  Monte  Carlo  simulation 
described  in  Section  3  for  each  of  the  observed  effects  at 
the  combination  doses  at  the  fixed  ray.  Under  o  —  0.2, 
the  ratios  for  the  seven  samples  have  a  mean  2.05  with 
standard  deviation  0.58,  and  range  from  1.12  to  3.28.  Un¬ 
der  a  =0.4,  the  ratios  for  the  seven  samples  have  a  mean 
2.33  with  standard  deviation  0.92,  and  range  from  1.01  to 
4.15  after  removing  an  extreme  of  the  observation  with 
effect  close  to  one  (Figure  3,  Panel  B7).  Therefore,  when 
several  observations  for  combination  doses  at  a  fixed  ray 
are  available,  the  confidence  bound  derived  in  Section  3 
using  more  available  information  is  more  efficient,  thus 
preferred.  In  addition,  when  we  ran  this  simulation  to  get 
the  14  confidence  bounds  in  the  14  panels  in  Figure  3 
by  separately  using  the  confidence  interval  based  on  (15) 
and  Monte  Carlo  procedure  on  an  Intel  1.83  GHz  com¬ 
puter,  the  time  it  took  was  10  seconds  and  17  minutes, 
respectively.  It  is  clear  that  the  calculation  for  the  confi¬ 
dence  bound  based  on  (15)  is  much  faster  than  that  based 
on  Monte  Carlo  simulations. 

4.2  Case  Studies 

Our  research  group  (Lee  et  al.  2007;  Kong  and  Lee 
2006)  investigated  drug  interactions  between  two  novel 
agents,  SCH66336  and  4-HPR,  in  a  number  of  squamous 
cell  carcinoma  cell  lines  (Chun  et  al.  2003).  Here  we 
present  the  dataset  and  results  from  cell  line  UMSCC22B 
in  Table  2  for  investigating  drug  interaction  in  combina¬ 
tion  doses  at  the  fixed  ray  with  ^  =  j. 

We  first  obtained  the  dose-effect  curves  for  SCH66336 
and  4-HPR  by  a  linear  regression  of  log  on  log  d , 
based  on  the  data  in  Table  2.  Recall  that  log  = 
m(\ogd  —  1  ogDm)  =  fib  T  fix  lo gd.  The  estimates  of 
fib,  fi\,  Dm ,  and  d  for  drug  1,  drug  2,  and  the  mixture  of 
the  drugs  with  equal  concentrations  are  summarized  in 
the  same  table. 


The  transformed  data  log  versus  log  d  and  the 
median-effect  plots  are  shown  in  Figure  4(A).  This  me¬ 
dian  effect  plot  indicates  that  the  data  follow  the  median- 
effect  Equation  (2)  reasonably  well.  Based  on  the  fit¬ 
ted  median-effect  equations,  we  calculated  the  interac¬ 
tion  indices  based  on  (12)  for  varied  effects  for  combina¬ 
tion  doses  at  the  fixed  ray  with  ^  =  y  and  constructed 
their  associated  confidence  bounds  based  on  (15)  and  on 
Monte  Carlo  simulations  (Belen’kii  and  Schinazi  1994), 
respectively.  Figure  4(B)  shows  the  plot  of  the  interac¬ 
tion  indices  (on  the  logarithm  scale)  versus  effects  (solid 
line)  for  combination  doses  at  this  fixed  ray  and  the  95% 
pointwise  confidence  bounds  based  on  (15)  (dotted  lines) 
and  on  Monte  Carlo  simulations  (dashed  lines).  Based 
on  the  confidence  bound  (dotted  line),  we  conclude  that 
the  combination  doses  at  the  fixed  ray  with  ^  =  y  with 
effect  less  than  0.52  are  synergistic,  and  the  combina¬ 
tion  doses  at  the  fixed  ray  with  effect  greater  than  0.52 
are  additive.  The  conclusions  based  on  the  confidence 
bounds  obtained  from  Monte  Carlo  simulations  (dashed 
lines)  are  slightly  different.  We  also  calculated  four  inter¬ 
action  indices  based  on  (6)  and  their  confidence  intervals 
based  on  (9)  at  the  four  observed  data  points  {d\,  J2)  as 
being  (0.1,  0.1),  (0.5,  0.5),  (1,  1),  and  (2,  2).  The  four 
interaction  indices  were  0.791,  0.609,  0.256,  and  0.103, 
and  their  corresponding  95%  confidence  intervals  were 
[0.202,  3.091],  [0.169,  2.193],  [0.060, 1.087],  and  [0.018, 
0.581],  respectively.  These  pairs  of  interaction  indices 
versus  effects,  along  with  their  95%  confidence  intervals 
are  shown  as  vertical  bars  in  Figure  4(B).  From  these 
four  interaction  indices  and  their  confidence  intervals,  we 
conclude  that  the  combination  doses  at  the  fixed  ray  with 
^  =  |  are  synergistic  for  doses  >  2  //M  for  each  single 
drug,  and  additive  for  doses  <  1  //M  for  each  single  drug. 
The  conclusions  from  the  two  procedures  in  Section  2 
and  3  are  slightly  different:  the  combination  doses  (1,1) 
and  (0.5,  0.5)  with  respective  observed  effect  0.3551  and 
0.4919  were  identified  as  additive  based  on  the  second 
and  third  vertical  bar  (reading  from  left  to  right),  while 
based  on  the  confidence  bound  (dotted  lines),  the  combi¬ 
nation  doses  were  identified  as  synergistic  as  each  effect 
was  less  than  0.52.  The  relative  length  of  the  confidence 
interval  based  on  (9)  versus  the  monte  Carlo  confidence 
interval  ranges  from  1.27  to  3.72  for  the  four  combination 
doses  at  the  fixed  ray.  Once  again,  this  example  shows 
that  the  confidence  interval  estimation  based  on  a  sin¬ 
gle  observation  (9)  is  not  as  efficient  as  the  correspond¬ 
ing  confidence  interval  based  on  model  (15)  which  used 
more  data. 

We  also  examined  another  dataset  from  a  drug  combi- 
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A  4  :  Effect  E  A  5  :  Effect  E 


A  7  :  Effect  E 


B  6  :  Effect  E 


B  7  :  Effect  E 


Figure  3.  Simulated  results  under  Scenario  2.  The  solid  line  is  the  plot  of  the  underlying  true  interaction  indices  versus  effects  for  combination 
doses  at  the  fixed  ray  with  ^  =  j  ,  the  dotted  lines  and  the  dashed  lines  are  the  95%  pointwise  confidence  bounds  for  the  curve  of  the  interaction 
index  versus  effect  based  on  the  delta  method  and  Monte  Carlo  simulation  under  the  settings  o  =  0.2  (Panels  A)  and  o  =  0.4  (Panel  B), 
respectively,  and  the  dots  and  the  vertical  bars  are  the  estimated  interaction  indices  and  their  confidence  intervals  for  observed  combinations. 


nation  study  for  o-phenanthroline  and  ADP  on  the  inhibi¬ 
tion  of  horse  liver  alcohol  dehydrogenase,  which  was  an¬ 
alyzed  by  Chou  and  Talalay  (1984)  and  by  Belen’kii  and 
Schinazi  (1994).  The  dataset  and  the  estimated  median- 
effect  parameters  are  shown  in  Table  3.  The  median- 
effect  plots  for  the  two  drugs  and  their  mixture  ( d\ ,  J2) 
at  the  fixed  ray  with  ^  are  shown  in  Figure  5(A). 

The  plot  of  interaction  indices  versus  the  fractional  in¬ 
hibitions  at  this  fixed  ray  are  shown  as  a  solid  line  in 
Figure  5(B).  In  the  same  panel,  we  illustrate  the  point- 
wise  confidence  bound  (dotted  lines)  based  on  (15)  and 
the  pointwise  confidence  bound  (dashed  line)  based  on 


Monte  Carlo  simulations  for  this  curve,  and  the  estimated 
interaction  indices  (dots)  and  their  associated  confidence 
intervals  (vertical  bars)  based  on  (9)  for  the  combination 
doses  having  observed  effects.  Again,  the  two  95%  con¬ 
fidence  bounds  are  almost  the  same,  the  vertical  bars  are 
wider  than  the  confidence  bound,  and  the  conclusions  on 
drug  interactions  based  on  vertical  bars  and  those  based 
on  confidence  bounds  are  consistent. 

We  developed  two  S-PLUS/R  programs.  One  is  used 
to  estimate  the  interaction  index  and  its  confidence 
interval  for  a  single  combination  dose  of  multiple 
drugs,  and  the  other  is  used  to  estimate  the  pointwise 
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Table  2.  Fractions  of  squamous  cell  carcinoma  cells  (UMSCC22B)  surviving  after  72  hours  of  treatment  by  single  and  combination  dose  levels 
of  SCH66336  and  4-HPR  and  the  fitted  median-effect  parameters. 


SCH66336 
dose  (/zM) 

4-HPR  dose 
dose  (/iM) 

Fractional 

survival 

Median-effect 

parameters 

0.1 

0.6701 

fi0fl  =  0.094(0.085) 

0.5 

0.6289 

fil’l  =  -0.335(0.066) 

1 

0.5577 

Dm\  =  1.326 

2 

0.4550 

b  i  =  0.187 

4 

0.3755 

0.1 

0.7666 

A), 2  =  0.217(0.073) 

0.5 

0.5833 

Plt2  =  -0.398(0.058) 

1 

0.5706 

Dm2  =  1.726 

2 

0.4934 

<r2  =  0.129 

0.1 

0.1 

0.6539 

A), 12  =  -0.225(0.092) 

0.5 

0.5 

0.4919 

h,Yl  =  -0.596(0.082) 

1 

1 

0.3551 

Dm\  2  =  0.686 

2 

2 

0.2341 

b\2  =  0.182 

Note:  The  number  inside  the  parentheses  in  the  last  column  is  the  standard  error  of  the  estimate. 


A.  Median  Effect  Plots  B.  Inter.  Index  vs.  Effect 


the  two  dashed  lines  are  the  pointwise  95%  confidence  bounds  for  the  curve  of  interaction  index  versus  effect  based  on  the  delta  method  in  Section 
3  and  Monte  Carlo  simulation,  respectively,  and  the  dots  and  the  vertical  bars  are  the  estimated  interaction  indices  and  their  confidence  intervals 
for  observed  combinations.  The  four  vertical  bars  from  left  to  right  correspond  to  the  combination  doses  of  (2,  2),  (1,  1),  (0.5,  0.5),  and  (0.1,  0.1), 
respectively. 
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Table  3.  Inhibition  of  horse  liver  alcohol  dehydrogenase  by  o-phenanthroline  and  ADP  alone  and  in  combination  (Chou  and  Talalay  1984; 
Belen’kii  and  Schinazi  1994). 


Fractional 

Median-effect 

o-phenanthroline 

ADP 

Inhibition 

parameters 

8.7 

0.132 

fio  ti  =  -4.696(0.145) 

17.4 

0.267 

=  1.302(0.046) 

26.1 

0.411 

Dmi  =  36.803 

34.8 

0.476 

<j\  =  0.058 

43.5 

0.548 

0.5 

0.175 

fi0i2  =  -0.601(0.079) 

1.0 

0.400 

Pl,2  =  1.178(0.127) 

1.5 

0.492 

Dm2  —  1-666 

2.0 

0.542 

a2  =0.161 

2.5 

0.592 

9.2x|L4 

9.2  x 

0.507 

A), 12  =  -3.843(0.038) 

18.4  x 

18.4  Xjij 

0.769 

^1,12  =  1-739(0.012) 

v  17.4 

27,6  m 

27.6  x  jj|^ 

0.872 

Dm\ 2  =9.117 

36.8  X  ||| 
46-°XTO 

36.8  x 

46.°  x 

0.919 

0.944 

b\2  =  0.015 

Note:  The  number  inside  the  parentheses  in  the  last  column  is  the  standard  error  of  the  estimate. 


confidence  bound  for  the  curve  of  interaction  index 
versus  effect  for  combination  doses  at  a  fixed  ray.  The 
S-PLUS/R  code  and  the  data  example  are  available  in 
CI-ofJnte r action  Jndex,  which  can  be  downloaded  from 
http:// bio  statistics,  mdanderson.  org/SoftwareDownload/. 

5.  Discussion 

We  proposed  a  procedure  in  Section  2  to  estimate  the 
interaction  index  and  constructed  its  associated  confi¬ 
dence  interval  for  a  multiple  drug  combination.  In  most 
cases,  the  dose-effect  for  a  single  agent  is  known,  and  in¬ 
vestigators  are  interested  in  assessing  whether  drug  com¬ 
binations  are  synergistic.  When  resources  are  limited,  the 
experiment  can  be  conducted  in  only  a  limited  number 
of  combination  doses.  We  can  assess  drug  interactions 
for  those  combination  doses  based  on  the  procedure  pro¬ 
vided  in  Section  2.  Note  that  although  the  dose-effect 
curves  follow  Chou  and  Talalay ’s  median-effect  equation 
work  reasonably  well,  the  model  may  not  work  in  cer¬ 
tain  cases.  In  these  cases,  other  dose-effect  models  must 
be  sought.  For  example,  Lee  et  al.  (2009)  found  that  the 
Emax  model  describes  the  experimental  data  there  bet¬ 
ter  than  Chou  and  Talalay’s  median-effect  equation;  thus, 
the  £max  model  was  used  there.  Upon  finding  the  dose- 
effect  curves  of  any  parametric  form  which  fits  the  data, 
one  may  use  the  same  philosophy  to  estimate  the  interac¬ 
tion  index  and  construct  its  associated  confidence  inter¬ 
val  based  on  the  delta  method.  However,  using  this  “at  a 


combination  dose”  method,  one  can  assess  drug  interac¬ 
tions  only  at  combination  doses  having  observed  effects, 
and  the  drug  interaction  tends  to  be  predicted  as  additiv¬ 
ity  due  to  lack  of  efficiency  (i.e.,  wide  confidence  inter¬ 
vals)  even  with  nonadditive  drug  interactions. 

Chou  and  Talalay’s  method  based  on  a  ray  design  is 
widely  used.  We  provided  a  procedure  to  construct  point- 
wise  confidence  bound  for  Chou  and  Talalay’s  curve  of 
interaction  index  versus  effect  in  Section  3.  The  proce¬ 
dure  we  provided  avoids  extensive  calculations  used  in 
Monte  Carlo  techniques,  which  were  required  in  the  soft¬ 
ware  CalcuSyn  and  CompuSyn  and  in  the  method  pro¬ 
vided  by  Belen’kii  and  Schinazi  (1994).  From  the  sim¬ 
ulations  and  case  studies  in  Section  4,  we  find  that  the 
confidence  bounds  provided  in  Section  3  are  at  least  as 
good  as  the  confidence  bounds  constructed  using  Monte 
Carlo  techniques,  while  the  confidence  bounds  in  Sec¬ 
tion  3  are  much  faster  to  compute.  Our  limited  simula¬ 
tion  studies  also  show  that  the  approximation  based  on 
the  logarithm  transformation  and  t- statistic  works  rea¬ 
sonably  well  when  sample  size  was  as  low  as  19  in  one 
case  and  15  in  another  case. 

From  simulations  and  case  studies  in  Section  4,  it  is 
clear  that  the  confidence  intervals  based  on  single  obser¬ 
vations  (verticals  bars  in  Figures  3,  4,  and  5)  are  wider 
than  the  pointwise  confidence  bounds  based  on  a  ray  de¬ 
sign  which  use  more  data.  In  a  ray  design,  the  constructed 
confidence  bound  used  all  the  information  on  this  ray, 
therefore,  the  estimates  based  on  Section  3  will  be  more 
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A.  Median  Effect  Plots  B.  Inter.  Index  vs.  Effect 


Figure  5.  Median-effect  plots  (Panel  A)  and  the  plot  of  interaction  indices  versus  effects  (Panel  B)  for  the  combination  doses  at  the  fixed  ray 
with  ^  =  CLfk  for  o-phenanthroline  and  ADP.  In  Panel  B,  the  solid  line  is  the  plot  of  the  estimated  interaction  indices  versus  effects,  the  two 
dotted  lines  and  the  two  dashed  lines  are  the  pointwise  confidence  bounds  for  the  curve  of  interaction  index  versus  effect  based  on  the  delta  method 
and  Monte  Carlo  simulation,  respectively,  and  the  dots  and  the  vertical  bars  are  the  estimated  interaction  indices  and  their  confidence  intervals  for 
observed  combinations. 


efficient  and  more  accurate.  By  examining  the  curve  of 
interaction  indices  versus  effects  on  several  rays  and  ex¬ 
amining  their  associated  confidence  bounds,  one  may  ob¬ 
tain  an  overall  picture  of  the  drug  interactions.  The  lim¬ 
itation  is  that  one  can  only  assess  drug  interactions  for 
combination  doses  at  these  fixed  examined  rays.  When 
a  factorial  design  or  a  uniform  design  (Tan,  Fang,  Tian, 
and  Houghton  2003)  is  used,  a  good  strategy  is  to  use  re¬ 
sponse  surface  models,  which  use  all  the  information  pre¬ 
sented  in  the  observed  data.  We  have  proposed  a  gener¬ 
alized  response  surface  model  (Kong  and  Lee  2006)  and 
a  semiparametric  model  (Kong  and  Lee  2007)  to  capture 
drug  interaction  for  all  combination  doses.  However,  re¬ 
sponse  surface  models  for  more  than  three  drugs  are  dif¬ 
ficult  to  construct.  Therefore,  to  assess  drug  interactions 
among  multiple  drugs,  the  directly  calculated  interaction 
index  and  the  plots  of  interaction  indices  versus  effects  at 
several  fixed  rays  are  still  feasible  and  remain  appealing 
methods  to  use.  The  confidence  intervals  we  provided  in 
Sections  2  and  3  are  easy  to  calculate,  and  have  a  de¬ 
sirable  coverage  rate.  Hence,  it  suggests  that  it  is  not 
necessary  to  run  extensive  Monte  Carlo  simulations  for 
obtaining  these  confidence  intervals.  Based  on  the  result 
of  this  article,  Lee  et  al.  (2009)  constructed  the  simulta¬ 
neous  confidence  interval  for  interaction  indices  over  a 
range  of  treatment  effects.  The  simultaneous  confidence 
interval  is  also  easy  to  calculate  but  is  more  conservative. 
The  proposed  confidence  intervals  can  help  us  to  gauge 
the  uncertainties  of  the  interaction  indices  for  combina¬ 


tion  doses  for  two  or  more  drugs  and  can  also  be  used  to 
provide  more  in-depth  assessment  for  drug  interactions. 
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SHORT  COMMUNICATION 

Genomic  profiling  identifies  TITF1  as  a  lineage-specific  oncogene  amplified 
in  lung  cancer 


KA  Kwei1,  YH  Kim1,  L  Girard2,  J  Kao1,  M  Pacyna-Gengelbach3,  K  Salari4,  J  Lee1,  Y-L  Choi1’5, 
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Lung  cancer  is  a  leading  cause  of  cancer  death,  where  the 
amplification  of  oncogenes  contributes  to  tumorigenesis. 
Genomic  profiling  of  128  lung  cancer  cell  lines  and  tumors 
revealed  frequent  focal  DNA  amplification  at  cytoband 
14ql3.3,  a  locus  not  amplified  in  other  tumor  types.  The 
smallest  region  of  recurrent  amplification  spanned  the 
homeobox  transcription  factor  TITF1  (thyroid  transcrip¬ 
tion  factor  1;  also  called  NKX2-1 ),  previously  linked 
to  normal  lung  development  and  function.  When  amplified, 
TITF1  exhibited  increased  expression  at  both  the 
RNA  and  protein  levels.  Small  interfering  RNA  (siRNA)- 
mediated  knockdown  of  TITF1  in  lung  cancer  cell  lines 
with  amplification  led  to  reduced  cell  proliferation, 
manifested  by  both  decreased  cell-cycle  progression  and 
increased  apoptosis.  Our  findings  indicate  that  TITF1 
amplification  and  overexpression  contribute  to  lung  cancer 
cell  proliferation  rates  and  survival  and  implicate  TITF1 
as  a  lineage-specific  oncogene  in  lung  cancer. 

Oncogene  advance  online  publication,  21  January  2008; 
doi:10.1038/sj.onc.l211012 

Keywords:  TITF1;  lineage-specific  oncogene;  genomic 
profiling;  lung  cancer;  TTF-1;  NKX2-1 


Lung  cancer  is  the  leading  cause  of  cancer  death  in  the 
United  States  (Jemal  et  al.,  2007).  In  lung  cancers,  the 
amplification  of  oncogenes  such  as  MYC,  KRAS ,  MET , 
EGFR ,  ERBB2 ,  CCND1  and  CDK4  contributes  to 
tumor  development  and  progression,  and  amplified 
genes  have  become  important  targets  for  molecularly- 
directed  therapies  (Sato  et  al .,  2007).  To  discover  novel 
amplicons,  we  profiled  52  non-small  cell  lung  cancer 
(NSCLC)  cell  lines  and  76  NSCLC  tumors  (36  adeno¬ 
carcinomas  including  2  metastases,  and  40  squamous  cell 
carcinomas  including  1  metastasis),  by  array-based 
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comparative  genomic  hybridization  (aCGH)  on  cDNA 
microarrays  (Pollack  et  al .,  1999)  covering  ~  22  000 
human  genes  with  a  median  inter-probe  distance  of 
^30  kb.  The  most  frequent  focal  DNA  amplification 
not  associated  with  a  previously  known  oncogene 
occurred  at  cytoband  14ql 3.3  (Figure  la). 

Gain  at  14ql 3.3  was  present  in  17  of  52  (33%)  lung 
cancer  cell  lines,  where  it  was  more  often  observed  in  cell 
lines  derived  from  adenocarcinomas  (including  bronch¬ 
ioloalveolar  carcinomas)  compared  to  other  histologies 
(P  =  0.04,  Fisher’s  exact  test;  unspecified  NSCLCs 
excluded  from  the  analysis)  (Figure  lb).  14ql 3.3  gain 
was  also  detectable  in  4  of  36  (11%)  lung  adenocarci¬ 
nomas  and  in  1  of  40  (3%)  squamous  cell  carcinomas 
(all  samples  with  gain  were  primary  tumors).  The  lower 
frequencies  observed  in  patient  tumors  may  reflect 
an  under-calling  of  gains  due  to  contaminating  non- 
cancerous  cells  in  tumor  samples  or  alternatively  to  a 
bias  in  the  tumors  attempted  or  successfully  established 
as  cultures  or  to  selective  pressures  on  cultured  cells. 
Gain  of  14ql3.3  was  significantly  associated  with  the 
presence  of  ifGFR-activating  mutations  (P  =  0.03, 
Mann-Whitney  U-test)  (but  not  KRAS  or  TP53 
mutations),  as  well  as  the  presence  of  specific  DNA 
gains/losses  elsewhere  in  the  genome,  including  gain 
at  5pl 5.33  ( TERT)  (Supplementary  Table  1).  Notably, 
we  have  not  observed  the  14ql 3.3  locus  to  be  amplified 
in  other  tumor  types  that  we  have  profiled  on  the 
same  platform,  including  cancers  (totaling  385  speci¬ 
mens)  of  the  breast,  prostate,  colon  and  pancreas 
(Bashyam  et  al .,  2005;  Bergamaschi  et  al .,  2006; 
Lapointe  et  al .,  2007;  unpublished  data),  suggesting 
that  the  putative  driver  oncogene(s)  within  this  locus  is 
lung  cancer  specific. 

The  smallest  region  of  recurrent  amplification,  corro¬ 
borated  by  CGH  on  a  custom  high-definition  oligonu¬ 
cleotide  microarray  with  probes  spanning  1 4q  1 3 . 2— q  13.3 
at  300  bp  intervals  (Figure  lc),  included  just  eight  named 
genes:  NFKBIA ,  INSM2,  GARNLf  BRMS1L ,  MBIP , 
TITFf  NKX2-8  and  PAX9.  Because  TITF1  (thyroid 
transcription  factor  1;  also  called  TTF-1  and  NKX2-1) 
was  known  to  participate  in  normal  lung  development 
(Kimura  et  al .,  1996)  and  had  been  characterized  as  a 
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Figure  1  TITF1  is  focally  amplified  in  lung  cancer,  (a)  Frequency  plot  of  cytobands  harboring  high-level  DNA  amplification  in 
NSCLC  cell  lines  and  tumors  (Supplementary  Table  3).  Cell  lines  were  obtained  from  the  Hamon  Center  for  Therapeutic  Oncology 
Research,  UT  Southwestern  Medical  Center.  Tumors  were  banked  at  the  University  Hospital  Charite,  Berlin,  Germany,  with  patient 
consent  and  Institutional  Review  Board  approval,  and  DNA  was  extracted  from  several  30  pm  cryostat  tissue  sections  containing 
^70%  tumor  cells.  CGH  was  performed  on  cDNA  microarrays  from  the  Stanford  Functional  Genomics  Facility  containing  39  632 
human  cDNAs  (representing  22279  different  mapped  human  genes/cDNAs),  using  our  published  protocol  (Pollack  et  al .,  2002).  Map 
positions  for  arrayed  cDNA  clones  were  assigned  using  the  NCBI  genome  assembly,  accessed  through  the  UCSC  genome  browser 
database  (NCBI  Build  36)  (Kent  et  al .,  2002).  High-level  DNA  amplification  was  defined  as  tumor/normal  aCGH  ratios  >  3;  selected 
cytobands  with  frequent  amplification  are  indicated.  The  complete  microarray  data  set  is  accessible  from  the  GEO  repository 
(GSE9995).  (b)  Genomic  profiles  by  CGH  on  cDNA  microarrays  for  NSCLC  cell  lines  and  tumors,  histologies  indicated 
(M  =  metastasis),  for  a  segment  of  chromosome  14ql3.1-q21.1.  Genes  are  ordered  by  genome  position.  Red  indicates  positive  tumor/ 
normal  aCGH  ratios  (scale  shown),  and  samples  called  gained,  using  the  fused  lasso  method  (Tibshirani  and  Wang,  2008),  at  14ql 3.3 
are  marked  below  by  closed  circles.  Genes  and  ESTs  (IMAGE  clone  ID  shown)  on  the  microarray  residing  within  the  amplicon  core 
are  highlighted  by  red  text,  (c)  Genomic  profiles  by  CGH  on  an  Agilent  (Santa  Clara,  CA,  USA)  high-definition  custom  microarray 
tiling  14ql3.2-q21.1.  The  arrays  comprised  10614  probes  tiling  3.3Mb  (nt  34457  000-37  750000)  at  14ql 3.2— q21 . 1  with  an  average 
inter-probe  spacing  of  310  nt,  with  an  additional  32451  probes  spanning  the  remaining  genome  for  data  normalization.  DNAs  were 
labeled  as  above,  then  hybridized  to  the  array  following  the  manufacturer’s  instructions.  Arrays  were  scanned  using  an  Agilent  G2505B 
scanner  and  data  extracted  and  normalized  using  Agilent  Feature  Extraction  software  (version  9.1)  with  default  settings.  Shown  are 
two  informative  samples  defining  the  amplicon  boundaries,  mapped  onto  the  UCSC  genome  browser.  The  smallest  region  of  recurrent 
amplification  spans  eight  named  genes.  cDNA,  complementary  DNA;  CGH,  comparative  genomic  hybridization;  NSCLC,  non-small- 
cell  lung  cancer;  UCSC,  University  of  California  Santa  Cruz. 


histological  marker  of  lung  adenocarcinoma  (Travis 
et  al .,  2004),  we  sought  to  explore  a  possible  functional 
connection  of  TITF1  gene  amplification  with  lung 
cancer. 


Consistent  with  an  oncogenic  role,  TITF1  exhibited 
increased  expression  at  both  the  RNA  (P  =  0.046, 
Mann-Whitney  U- test)  (Figure  2b)  and  protein  level 
(Figure  2c)  in  NSCLC  cell  lines  with  amplification. 
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Figure  2  TIT  FI  is  overexpressed  when  amplified  in  lung  cancer  lines,  (a)  FISH  validation  of  TIT  FI  amplification  in  select  NSCLC 
cell  lines.  FISH  was  performed  using  Vysis  (Downers  Grove,  IL,  USA)  reagents  according  to  the  manufacturer’s  protocols.  A  locus- 
specific  BAC  mapping  to  TITF1  at  14ql 3.3  (RP11-1083E2;  BACPAC  Resources  Centre,  Oakland,  CA,  USA)  was  labeled  with 
SpectrumGreen,  and  co-hybridized  with  a  SpectrumOrange-labeled  telomere-14q  probe  (Vysis).  Slides  were  counterstained  with  DAPI 
and  imaged  using  an  Olympus  BX51  fluorescence  microscope  with  Applied  Imaging  (San  Jose,  CA,  USA)  Cytovision  3.0  software. 
DNA  amplification  is  evidenced  by  increased  TITF1  (green)/telomere-14q  (red)  signals  (HCC1195,  right)  or  by  TITF1  signal  clusters 
(HCC1833,  left),  (b)  mRNA  levels  of  TITF1 ,  measured  by  microarray,  are  elevated  in  NSCLC  cell  lines  with  TITF1  amplification  and 
also  in  comparison  to  primary  and  immortalized  (but  non-tumorigenic)  lung  epithelial  cultures  (Ramirez  et  al.,  2004).  Gene  expression 
profiling  was  performed  as  described  (Lapointe  et  al.,  2004).  Reported  fluorescence  ratios  for  TITF1  are  normalized  to  the  average 
TITF1  expression  level  across  all  samples.  Box  plots  show  25th,  50th  and  75th  percentiles  of  expression.  P-values  (Mann-Whitney 
U-test)  are  indicated,  (c)  Western  blot  analysis  of  representative  NSCLC  cell  lines  indicates  that  TITF1  is  overexpressed  at  the  protein 
level  when  amplified.  Electrophoresis  and  blotting  were  performed  as  described  (Kao  and  Pollack,  2006).  TITF1  (~47kDa)  and 
GAPDH  (loading  control)  were  detected  using  anti-TITFl  rabbit  polyclonal  antibody  (1:200;  Santa  Cruz  Biotechnology,  Santa  Cruz, 
CA,  USA)  and  anti-GAPDH  rabbit  polyclonal  antibody  (1:5000;  Santa  Cruz  Biotechnology),  followed  by  HRP-conjugated  anti-rabbit 
IgG  (1:20000;  Pierce,  Rockford,  IL,  USA).  Detection  was  carried  out  using  the  ECL  kit  (GE  Healthcare,  Piscataway,  NJ,  USA). 
FISH,  fluorescence  in  situ  hybridization;  GAPDH,  glyceraldehyde  3-phosphate  dehydrogenase;  NSCLC,  non-small-cell  lung  cancer; 
TITF1,  thyroid  transcription  factor  1. 


Notably,  while  other  genes  at  14ql3.3  also  exhibited 
increased  expression  when  amplified,  TITF1  was  the 
only  well-measured  gene  that  also  exhibited  significantly 
increased  expression  in  tumorigenic  compared  to  non- 
tumorigenic  cell  lines  (Figure  2b  and  Supplementary 
Table  2).  Sequencing  of  the  TIT  FI  open  reading  frame 
(and  splice  sites)  from  four  NSCLC  cell  lines  (HCC1 195, 
HCC1833,  H2009  and  H661)  with  amplification 
revealed  no  DNA  mutations,  indicating  that  amplifica¬ 
tion-driven  overexpression  of  the  wild-type  gene  product 
would  be  of  relevance. 


To  assess  the  functional  significance  of  TITF1 
amplification  and  overexpression,  we  used  RNA  inter¬ 
ference  to  target  TITF1  knockdown  in  two  lung  cancer 
cell  lines,  HCC1833  and  HCC1195,  with  TITF1 
amplification  validated  by  fluorescence  in  situ  hybridiza¬ 
tion  (Figure  2a).  Transfection  of  a  Dharmacon  On- 
TARGETplus  pool  of  four  different  short  interfering 
RNAs  (siRNAs),  designed  and  chemically  modified  to 
minimize  off-target  effects  (Birmingham  et  al.,  2006; 
Jackson  et  al.,  2006),  led  to  decreased  TITF1  protein 
(Figure  3a)  and  decreased  cell  proliferation  compared  to 
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a  negative  control  siRNA  pool  (Figure  3b).  Transfection 
of  individual  siRNAs  from  the  same  pool  showed 
comparable  effects  (data  not  shown).  In  contrast, 
siRNA  transfection  of  a  lung  cancer  cell  line  (HI  155) 
without  TITF1  amplification  but  with  detectable 
expression  did  not  diminish  cell  proliferation,  indicating 
the  functional  importance  of  amplification-driven  over¬ 
expression.  Transfection  of  a  cell  line  (HI 299)  without 
amplification  or  detectable  expression  also  did  not  alter 
cell  proliferation,  supporting  the  specificity  of  TITF1 


targeting  (Figure  3b).  Similar  negative  results  were 
observed  upon  transfection  of  a  non-lung  cancer 
cell  line  (colorectal  cancer  line  SW48;  data  not  shown). 
In  the  lung  cancer  cell  lines  with  14ql 3.3  amplification, 
the  TITF1 -targeted  reduction  in  cell  proliferation  was 
attributable  to  both  decreased  cell-cycle  progression  (as 
evidenced  by  decreased  S-phase  fraction  with  Gi  block; 
Figure  3c)  and  increased  apoptosis  (Figure  3d).  The 
effects  were  more  pronounced  in  HCC1833  compared 
to  HCC1195. 


a  HCC1833  HCC1195  H1 155  H1299 

siRNA  pool:  control  TITF1  control  TITF1  control  TITF1  control  TITF1 
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TITF1  is  a  tissue-specific  transcription  factor  required 
for  branching  morphogenesis  during  normal  lung  devel¬ 
opment,  as  well  as  for  the  differentiation  of  pulmonary 
epithelial  cells,  as  marked  by  the  expression  of  surfactant 
proteins  (which  are  among  its  transcriptional  targets) 
(Bohinski  et  al .,  1994;  Kimura  et  al .,  1996;  Minoo  et  al ., 
1999).  In  the  developing  and  adult  lung,  TITF1  is 
expressed  mainly  in  peripheral  airway  cells  and  small¬ 
sized  bronchioles  (Yatabe  et  al .,  2002).  TITF1  has  also 
been  found  to  be  expressed  in  approximately  40-50%  of 
NSCLCs,  more  frequently  in  adenocarcinomas  compared 
to  squamous  cell  carcinomas  and  with  expression  linked 
to  more  favorable  prognosis  in  some  but  not  all  studies 
(Puglisi  et  al .,  1999;  Tan  et  al .,  2003).  Immuno staining  of 
TITF1  is  used  as  the  major  lineage-specific  marker  to 
distinguish  primary  lung  adenocarcinoma  from  meta¬ 
static  adenocarcinoma  to  the  lung  (Travis  et  al .,  2004). 
Our  findings  indicate  that  TIT  FI  amplification  and 
resultant  overexpression  contribute  to  increased  cell 
proliferation  rates  and  survival  in  lung  cancer  cells,  and 
now  implicate  TITF1  as  a  lung  cancer-specific  oncogene. 

Recently,  Tanaka  et  al  (2007)  reported  that  TITF1 
knockdown  led  to  decreased  colony  formation,  which 
they  attributed  to  increased  apoptosis  in  a  cell  line  (NCI- 
H358)  with  TITF1  expression  but  (in  their  hands)  no 
amplification.  In  our  study,  we  showed  that  TITFl 
amplification  led  to  protein  overexpression  and  sensitivity 
to  siRNA-mediated  knockdown,  highlighting  the  role  of 
TITFl  amplification  as  a  critical  event  in  the  pathogen¬ 
esis  of  a  subset  of  lung  cancers.  Tanaka  et  al  also 
reported  increased  gene  dosage  of  TITFl ,  measured  by 
Southern  blot  and  TaqMan  PCR,  in  2%  of  primary  lung 
adenocarcinomas.  The  higher  proportion  we  observed 
(11%)  may  reflect  a  higher  sensitivity  of  aCGH  (where 
multiple  probes  per  locus  are  considered),  or  differences 
between  the  patient  cohorts.  Importantly,  aCGH  analysis 
also  permitted  us  to  define  the  14ql3  amplicon  structure 
and  boundaries,  where  our  studies  placed  TITFl  squarely 
within  the  amplicon  core,  consistent  with  a  ‘driver’  role. 

Given  its  connection  to  pulmonary  epithelium  differ¬ 
entiation,  an  oncogenic  role  of  TITFl  might  seem 
counterintuitive.  However,  other  tissue-specific  transcrip¬ 
tion  factors  have  been  found  to  be  amplified  in  cancers, 


including  MITF  in  melanoma  (Garraway  et  al ,  2005),  AR 
in  prostate  cancer  (Visakorpi  et  al ,  1995)  and  ESR1  in 
breast  cancer  (Holst  et  al ,  2007).  The  deregulated 
expression  of  such  transcription  factors,  with  normal  roles 
in  lineage  proliferation  or  survival,  may  be  required  for 
tumor  survival  and  progression  in  some  cellular  and 
genetic  contexts,  reflecting  a  state  of  ‘lineage  dependency’ 
(Garraway  and  Sellers,  2006).  More  generally,  the 
deregulated  expression  of  transcription  factors  with  roles 
in  normal  development  reflects  the  principle  of  ‘oncology 
recapitulates  ontogeny’  (Lechner  et  al ,  2001). 

Our  finding  that  14ql3.3  amplification  occurs  mainly 
in  lung  adenocarcinomas  (and  their  derived  cell  lines), 
the  same  histology  in  which  TITFl  expression  (even 
when  not  amplified)  is  predominantly  restricted,  is 
consistent  with  TITFl  being  the  primary  driver  onco¬ 
gene  within  14ql 3.3.  Nonetheless,  other  genes  within  the 
14ql3  amplicon  may  contribute  to  tumorigenesis. 
Notably,  also  residing  within  the  amplicon  core  are 
two  other  homeodomain-containing  genes,  NKX2-8  and 
PAX9 ;  the  former  of  which  has  been  recently  implicated 
in  the  control  of  normal  lung  development  (Tian  et  al , 
2006).  Our  preliminary  data  (not  shown)  indicate  that 
PAX9  is  overexpressed  when  amplified  in  some  NSCLC 
cell  lines,  and  exhibits  positive  immunostaining  in  a 
subset  of  lung  tumors.  In  another  context,  TITFl  and 
the  paired-box  member  PAX8  have  been  shown  to 
cooperatively  activate  the  expression  of  thyroid-specific 
target  genes  (Miccadei  et  al ,  2002).  It  is  tempting  to 
speculate  that  co-amplification  of  NKX2-8  and/or 
PAX9 ,  together  with  TITFl ,  may  contribute  to  lung 
cancer  development  or  progression.  Indeed,  very  re¬ 
cently  Kendall  et  al  (2007),  who  discovered  the  same 
14ql3.3  amplicon,  reported  that  both  NKX2-8  and 
PAX9  can  synergize  with  TITFl  to  promote  prolifera¬ 
tion  of  immortalized  human  lung  epithelial  cells. 

Future  investigations  are  required  to  more  precisely 
define  the  transcriptional  effectors  and  pathways 
through  which  TITFl  mediates  its  oncogenic  function. 
Nonetheless,  our  genomic  profiling  and  functional 
studies  implicate  TITFl  as  a  lineage-specific  oncogene 
in  lung  cancer,  a  discovery  that  may  lead  to  new 
opportunities  for  therapeutic  intervention. 


Figure  3  TITFl  amplification/overexpression  contributes  to  cell  proliferation,  (a)  Confirmation  of  siRNA-mediated  knockdown 
of  target  protein  TITFl  by  western  blot.  On-TARGETplus  siRNAs  targeting  TITFl,  along  with  a  negative  control  siRNA  pool 
(ON-TARGETplus  siCONTROL  Non-targeting  Pool),  were  obtained  from  Dharmacon  (Lafayette,  CO,  USA).  Sequences  of  siRNAs 
are  listed  in  Supplementary  Table  4.  Cell  lines  were  maintained  at  37  °C  in  RPMI-1640  with  10%  fetal  bovine  serum.  For  transfection, 
125  000-200000  cells  were  seeded  per  well  in  a  six-well  plate  and  transfected  using  Lipofectamine  2000  reagent  (Invitrogen,  Carlsbad,  CA, 
USA)  according  to  the  manufacturer’s  protocol,  using  a  final  concentration  of  50  nM  siRNA  for  6  h.  Cell  lysates  were  harvested  72  h  post¬ 
transfection;  GAPDH  served  as  a  loading  control,  (b)  TITFl  knockdown  results  in  decreased  cell  proliferation  in  cells  with  (HCC1833, 
HCC1195)  but  not  without  (HI  155,  H1299)  TITFl  amplification.  At  24,  48,  72  and  96  h  post-transfection,  cell  proliferation  was 
quantified  by  colorimetry  based  on  the  metabolic  cleavage  of  the  tetrazolium  salt  WST-1  in  viable  cells,  according  to  the  manufacturer’s 
protocol  (Roche,  Indianapolis,  IN,  USA).  Transfections  were  performed  in  replicate  and  mean  ( ±  1  s.d.)  OD  reported,  (c)  TITFl 
knockdown  reduces  cell-cycle  progression,  evidenced  by  decreased  S-phase  fraction  with  Gi  block.  At  72  h  post-transfection,  cell-cycle 
distribution  analysis  was  performed  using  the  BrdU-FITC  Flow  kit  (BD  Biosciences,  San  Jose,  CA,  USA)  as  per  the  manufacturer’s 
instructions.  Cells  were  incubated  with  10  pM  BrdU  at  37  °C  for  4h  prior  to  processing  for  analysis.  Anti-BrdU  FITC  and  7- 
aminoactinomycin  D  (for  total  DNA  content)  stainings  were  scored  by  FACSCalibur  (BD  Biosciences)  and  analysed  using  CellQuest 
software  (BD  Biosciences).  Transfections  were  performed  in  triplicate  and  mean  ( ±  1  s.d.)  cell-cycle  fractions  reported.  Representative 
FACS  plots  are  also  shown,  (d)  TITFl  knockdown  leads  to  increased  apoptosis.  At  72  h  post-transfection,  apoptosis  was  assayed  by 
annexin-V  staining  and  quantified  by  flow  cytometry  using  the  Vybrant  Apoptosis  Assay  kit  (Invitrogen)  as  per  the  manufacturer’s 
instructions.  Transfections  were  performed  in  triplicate,  and  mean  ( ±  1  s.d.)  percent  apoptosis  reported.  *F<0.05.  **F<0.01  (Student’s 
t-test;  TITFl  compared  to  control).  BrdU,  bromodeoxyuridine;  siRNA,  small  interfering  RNA;  TITFl ,  thyroid  transcription  factor  1. 
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Note  added  in  proof  .  The  TITFl  amplicon  was  also 
recently  identified  by  Weir  et  al.  (2007). 
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Introduction 


Lung  cancer  cells  have  defects  in  the  regulatory  circuits  that  govern  normal  cell  proliferation  and 
homeostasis.  Hanahan  and  Weinberg  (1)  described  the  “hallmarks  of  cancer”  as  six  essential 
alterations  in  cell  physiology  that  collectively  dictate  malignant  growth.  These  acquired 
capabilities  found  in  lung  cancers  are:  self-sufficiency  of  growth  signals,  insensitivity  to  growth- 
inhibitory  (antigrowth)  signals,  evasion  of  programmed  cell  death  (apoptosis),  limitless 
replicative  potential,  sustained  angiogenesis,  and  tissue  invasion  and  metastasis.  Transformation 
from  a  normal  to  malignant  lung  cancer  phenotype  is  thought  to  arise  in  a  multi-step  fashion, 
through  a  series  of  genetic  and  epigenetic  alterations,  ultimately  evolving  into  invasive  cancer  by 
clonal  expansion  (2).  These  progressive  pathological  changes  in  the  bronchial  epithelium  - 
known  as  preneoplastic  or  premalignant  lesions  -  occur  primarily  as  one  of  three  distinct 
morphological  forms:  squamous  dysplasia,  atypical  adenomatous  hyperplasia,  and  diffuse 
idiopathic  pulmonary  neuroendocrine  cell  hyperplasia  (3).  Bronchial  squamous  dysplasia  and 
carcinoma  in  situ  (CIS)  are  the  recognized  preneoplastic  lesions  for  squamous  cell  carcinoma; 
atypical  adenomatous  hyperplasia  (AAH),  a  putative  preneoplastic  lesion,  for  a  subset  of 
adenocarcinomas;  and  neuroendocrine  cell  hyperplasia  for  neuroendocrine  lung  carcinomas  (3). 
These  preneoplastic  lesions  however  account  for  the  development  of  only  a  subset  of  lung 
cancers,  for  example,  the  precursor  lesion  for  the  most  common  neuroendocrine  carcinoma  of  the 
lung,  small  cell  lung  carcinoma  (SCLC),  is  unknown.  Tumors  are  believed  to  become 
increasingly  malignant  with  time,  initiating  tumorigenesis  from  possibly  only  a  handful  of 
mutations  followed  by  additional  (and  different)  mutations  and  epigenetic  changes  acquired 
during  clonal  expansion,  where  cells  possessing  in  vivo  growth  advantage  become  dominant  (2). 
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The  identification  and  characterization  of  these  molecular  changes  in  lung  cancer  is  of 
fundamental  importance  for  improving  the  prevention,  early  detection,  treatment  and  palliation 
of  this  disease.  The  overall  goal  is  to  translate  these  findings  to  the  clinic  by  using  molecular 
alterations  as:  1)  biomarkers  for  early  detection;  2)  targets  for  prevention;  3)  tools  for  new 
molecular  approaches;  4)  signatures  for  personalizing  prognosis  and  therapy  selection  for  each 
patient,  and  5)  targets  to  specifically  kill  or  inhibit  the  growth  of  lung  cancer  in  patients. 

Lung  cancer  arises  from  neoplastic  changes  to  epithelial  cells  in  the  lung.  However,  it  is  not 
known  whether  all  lung  epithelial  cells  are  susceptible  to  malignant  transformation  or  only  a 
subset  of  these  cells,  specifically;  a  major  question  is  whether  the  changes  need  to  take  place  in 
lung  epithelial  cells  with  stem  cell-like  properties.  Lung  cancer  is  a  heterogeneous  disease 
clinically,  biologically,  histologically  and  molecularly.  The  underlying  causes  of  this 
heterogeneity  are  unknown  and  could  reflect  changes  occurring  in  cells  with  various  potential  for 
differentiation  (e.g.  squamous  or  adenomatous)  or  represent  different  molecular  changes 
occurring  in  the  same  target  lung  epithelial  cells.  This  heterogeneity  and  molecular  complexity 
contributes  to  the  difficulty  in  unraveling  the  pathogenesis  of  lung  cancer.  Multiple  oncogenes, 
tumor  suppressor  genes  (TSGs),  signaling  pathway  components  and  other  cellular  processes  are 
involved  in  the  molecular  pathogenesis  of  lung  cancer.  This  chapter  will  review  molecular 
aberrations  in  lung  cancer  and  the  multiple  pathways  through  which  it  develops. 

The  two  main  disease  categories  of  lung  cancer,  non-small  cell  lung  cancer  (NSCLC) 
(representing  80-85%  of  cases)  and  small  cell  lung  cancer  (SCLC)  (representing  15-20%)  are 
generally  classified  based  on  differences  in  histological,  clinical  and  neuroendocrine 
characteristics.  NSCLC  and  SCLC  can  also  differ  molecularly  with  many  genetic  alterations 
exhibiting  subtype  specificity  (summarized  in  Table  1).  Additionally,  molecular  studies  of 
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NSCLC  have  also  revealed  considerable  differences  between  the  subtypes  of  NSCLC, 
particularly  the  two  most  common  subtypes:  adenocarcinoma  and  squamous  cell  carcinoma. 

Epidemiology  and  susceptibility  in  lung  cancer 

Eighty-five  percent  of  lung  cancers  are  caused  by  tobacco  smoke  where  exposure  to  carcinogens 
present  in  tobacco  smoke  leads  to  the  acquisition  of  genetic  mutations  that  may  eventually 
initiate  carcinogenesis.  However,  not  all  lung  cancers  arise  in  smokers,  and  not  all  smokers  will 
develop  lung  cancer.  Thus,  inherited  factors  must  be  involved  which  may  predispose  an 
individual  to  develop  lung  cancer  -  either  by  increasing  susceptibility  to  the  damaging  effects  of 
carcinogen  exposure,  or  by  increasing  susceptibility  regardless  of  smoking  history.  Worldwide, 
approximately  25%  of  lung  cancer  cases  are  not  attributable  to  smoking  (4).  These  cases  occur 
more  frequently  in  women,  especially  in  Asiatic  countries,  target  the  distal  airways,  and  are 
commonly  adenocarcinomas.  Coupled  with  molecular  data  that  indicates  strikingly  different 
mutation  patterns  between  known  lung  cancer  genes  such  as  KRAS,  EGFR,  and  TP53  and 
clinical  data  in  relation  to  response  to  targeted  therapies  -  it  has  now  been  suggested  that  lung 
cancer  in  never  smokers  be  considered  a  distinct  disease  from  the  more  common  tobacco  smoke- 
related  lung  cancer  (4). 

Many  studies  have  examined  the  effect  of  single  nucleotide  polymorphisms  (SNPs)  on  the  risk  of 
developing  lung  cancer  (reviewed  in  (5,  6)).  The  reported  risk  effect  in  these  studies  is  generally 
modest  and  often  inconsistent,  explaining  why  none  are  in  routine  use.  However,  meta-analyses 
as  well  as  use  of  whole-genome  SNP  microarrays  may  hold  the  key  to  identifying  robust  and 
possible  synergistic  interactions  between  the  modest  affect  of  multiple  SNPs.  Of  note,  lung 
cancer  risk  was  recently  associated  with  genomic  variation  at  15q24/q25.1  by  three  separate 
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studies  simultaneously  that  used  whole-genome  SNP  microarrays  (7-9).  While  the  conclusions 
of  the  three  studies  differed  in  whether  the  risk  is  conferred  directly  with  cancer  or  through 
nicotine  addiction,  the  genes  within  this  locus  -  which  include  several  genes  encoding  nicotinic 
acetylcholine  receptor  subunits  (nAChR)  -  represent  important  targets  for  further  functional 
analyses. 

Genomic  instability,  telomeres  and  DNA  damage  in  lung  cancer 

Malignant  transformation  is  characterized  by  genetic  instability  which  can  exist  at  the 
chromosomal  level  (with  loss  or  gain  of  genomic  material,  translocations,  and  microsatellite 
instability),  at  the  nucleotide  level  (with  single  or  several  nucleotide  base  changes),  or  in  the 
transcriptome  (with  altered  gene  expression).  Abnormalities  are  typically  targeted  to  proto¬ 
oncogenes,  TSGs,  DNA  repair  genes  and  other  genes  that  can  promote  outgrowth  of  affected 
cells  (10).  The  erosion  of  telomeres  at  the  end  of  chromosomes  is  also  associated  with  genomic 
instability  leading  to  chromosomal  abnormalities.  Telomere  length  regulates  the  replicative 
capacity  of  a  cell,  where  progressive  telomere  shortening  occurs  with  each  replication.  Once  the 
telomere  becomes  too  short  the  cell  will  undergo  cellular  senescence  or  apoptosis.  Activation  of 
telomerase,  the  telomere-lengthening  enzyme,  in  premalignant  cells  prevents  loss  of  telomere 
ends  beyond  critical  points  and  is  essential  for  cell  immortality.  Although  silenced  in  normal 
cells,  telomerase  is  activated  in  >80%  of  NSCLCs  and  almost  uniformly  in  SCLCs  (11-13).  In 
normal  cells  the  presence  of  DNA  damage  engenders  a  DNA  repair  response,  and  if  this  is  not 
successful,  the  apoptosis  program  is  activated  to  remove  the  damaged  cell.  However,  in 
premalignant  and  cancer  cells  the  apoptosis  program  is  often  itself  damaged  thus  allowing  un-  or 
mis-repaired  DNA  damage  to  persist  in  clones  of  cells. 
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Oncogenes  and  growth  stimulatory  pathways 

Many  oncogenes  and  TSGs  have  been  identified  by  mapping  of  copy-number  changes 
throughout  the  cancer  genome  (14-23).  Earlier  genomic  analysis  technology  such  as  karyotyping 
and  comparative  genomic  hybridization  (CGH)  enabled  low  resolution  characterization  of  the 
lung  cancer  genome  identifying  whole  arm/large-scale  gain  or  loss  on  nearly  every  chromosomal 
arm,  but  most  commonly  3p,  4q,  9p,  and  17p  loss  and  lq,  3q,  5p,  and  17q  gain  (24,  25)  (Table 
1).  However,  high  resolution  microarray  analyses  can  now  narrow  in  on  these  aberrant  regions 
to  detect  focal  amplifications  and  deletions  often  spanning  only  a  handful  of  genes  (Table  1). 
Oncogenic  activation  typically  occurs  by  gene  amplification,  point  mutation,  rearrangement,  or 
through  gene  over-expression  by  other  mechanisms  including  those  mediated  by  miRNAs. 
These  changes  can  result  in  persistent  upregulation  of  mitogenic  growth  signals  which  induce 
cell  growth.  While  promoting  the  malignant  transformation  of  a  cell,  persistent  upregulation  of  a 
particular  growth  signal  or  pathway  can  also  result  in  ‘oncogenic  addiction’  -  whereby  the  cell 
becomes  dependent  upon  this  aberrant  oncogenic  signaling  for  survival  (26).  This  presents  an 
obvious  target  for  therapeutics;  remove  or  inhibit  the  oncogenic  signal  and  an  addicted  tumor  cell 
will  die  while  normal  ‘non-addicted’  cells  will  be  unaffected.  Signaling  pathways  commonly 
involved  in  lung  cancer  are  shown  in  Figure  1. 

Epidermal  growth  factor  receptor  signaling 

The  ErbB  family  of  tyrosine  kinase  receptors  includes  four  members  -  EGFR,  ErbB-2  (HER2), 
ErbB-3,  and  ErbB-4  (27).  While  the  intracellular  tyrosine  kinase  domains  of  the  four  receptors 
are  highly  conserved  the  extracellular  domain  is  not  so  conserved,  enabling  the  receptors  to  bind 
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different  ligands.  Following  ligand  binding,  the  ErbB  receptors  form  homo-  or  heterodimers 
which  results  in  receptor  activation  and  subsequent  activation  of  various  signaling  pathways. 
Activation  of  EGFR  through  the  binding  of  EGF  and  EGF-like  binding  growth  factors  such  as 
Transforming  Growth  Factor-a  (TGF-a)  enables  the  regulation  of  epithelial  cell  behavior  and  the 
initiation  of  tumors  from  epithelial  cell  origin  through  multiple  signaling  pathways.  These 
include  the  RAS/RAF/MEK/MAPK  pathway  (cell  proliferation),  and  the  PI3K/AKT  pathway 
and  signal  transduction  and  activator  of  transcription  (STAT)  3  and  STAT5  pathways  (cell 
survival  through  the  evasion  of  apoptosis)  (28)  (Figure  1).  EGFR  exhibits  over-expression  or 
aberrant  activation  in  approximately  50-90%  of  NSCLCs  with  activating  mutations  occurring 
with  or  without  amplification  (29-32).  Activating  mutations,  which  are  found  with  increased 
frequency  in  certain  subsets  of  lung  cancer  patients,  occur  as  three  different  types  of  somatic 
mutations  -  deletions,  insertions,  and  missense  point  mutations  -  and  are  located  in  exons  19-21 
which  code  for  the  tyrosine  kinase  domain  of  EGFR  (17,  19).  Mutant  EGFRs  (either  by  exon  19 
deletion  or  exon  21  L858R  mutation)  show  an  increased  amount  and  duration  of  EGFR 
activation  compared  with  wildtype  receptors  (17),  and  have  preferential  activation  of  the 
PI3K/AKT  and  STAT3/STAT5  pathways  rather  than  the  RAS/RAF/MEK/MAPK  pathway  (33). 
EGFR  mutant  tumors  (primarily  adenocarcinomas)  are  initially  highly  sensitive  to  EGFR 
tyrosine  kinase  inhibitors  (TKIs)  (17-19).  This  represents  an  example  of  oncogene  addiction  in 
lung  cancer  where  tumors  initiated  through  EGFR  mutation-activation  of  EGF  signaling  rely  on 
continued  EGF  signaling  for  survival.  However,  despite  an  initial  response,  patients  treated  with 
EGFR  TKIs  eventually  develop  resistance  to  TKIs  which  is  linked  (in  approximately  50% 
tumors)  to  the  acquiring  of  a  second  mutation  at  T790M  in  exon  20  (34-39).  Interestingly,  the 
presence  of  the  T790M  mutation  in  a  primary  lung  cancer  that  had  not  been  treated  with  EGFR- 
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TKIs  however,  suggests  that  this  resistance  mutation  may  develop  with  tumor  progression  and 
not  necessarily  as  a  response  to  treatment  (40).  Recently,  amplification  of  the  MET  proto¬ 
oncogene  has  been  associated  with  acquired  resistance  to  EGFR  TKIs  in  20%  of  resistant  cases 
(36,  41)  with  MET  activating  the  PI3K  pathway  through  phosphorylation  of  ERBB3, 
independent  of  EGFR  and  ERBB2  (41).  Importantly,  inhibition  of  MET  signaling  was  able  to 
restore  sensitivity  to  TKIs. 

The  RAS/RAF/MEK/MAPK/MYC pathway 

The  RAS  proto-oncogene  family  (KRAS,  HRAS,  NRAS  and  RRAS)  encode  four  highly 
homologous  21kDa  membrane-bound  proteins  involved  in  signal  transduction.  Proteins  encoded 
by  the  RAS  genes  exist  in  two  states:  an  active  state,  in  which  GTP  is  bound  to  the  molecule  and 
an  inactive  state,  where  the  GTP  has  been  cleaved  to  GDP  (42).  Activating  point  mutations  can 
confer  oncogenic  potential  through  a  loss  of  intrinsic  GTPase  activity  resulting  in  an  inability  to 
cleave  GTP  to  GDP.  This  can  initiate  unchecked  cell  proliferation  through  the 
RAS/RAF/MEK/MAPK  pathway,  downstream  of  the  EGFR  signaling  pathway  (43).  Activating 
RAS  mutations  occur  in  approximately  15-20%  of  NSCLCs  and,  in  particular,  30-50%  of 
adenocarcinomas  (44).  In  lung  cancer,  90%  of  mutations  are  located  in  KRAS  (80%  in  codon  12, 
and  the  remainder  in  codons  13  and  61)  with  HRAS  and  NRAS  mutations  only  occasionally 
documented  (44).  KRAS  mutations  are  mutually  exclusive  with  EGFR  and  ERBB2  mutations, 
and  confer  resistance  to  EGFR  TKIs  and  chemotherapy  (45-47).  Additionally,  while  KRAS 
mutations  are  primarily  observed  in  lung  adenocarcinomas  of  smokers,  EGFR  mutations  are 
primarily  observed  in  lung  adenocarcinomas  of  never-smokers  (4).  These  data  demonstrate  how 
lung  adenocarcinoma  can  develop  through  different  pathways,  and  it  is  likely,  given  the 
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importance  of  EGFR  targeted  therapy  that  determination  of  EGFR  and  KRAS  mutations  in 
tumors  will  soon  become  part  of  standard  care. 

BRAF  mutations  occur  in  1-5%  of  lung  cancers  and  mutant  BRAF  mouse  models  can  develop 
lung  adenocarcinomas  (48).  The  MYC  proto-oncogene  members  are  targets  of  RAS  signaling 
and  key  regulators  of  numerous  downstream  pathways  such  as  cell  proliferation  (49).  Activation 
of  MYC  members  often  occurs  through  gene  amplification.  MYC  is  most  frequently  activated  in 
NSCLC  (50),  with  the  other  two  members,  MYCN  and  MYCL  along  with  MYC,  usually 
activated  in  SCLC  (24,  51). 

The  PI3K/AKT pathway 

The  PI3K/AKT  pathway  which  lies  downstream  of  several  receptor  tyrosine  kinases  (RTKs) 
(such  as  EGFR)  is  a  key  regulator  of  cell  proliferation,  cell  growth,  and  cell  survival  and  is 
commonly  activated  in  lung  cancer  through  changes  in  several  of  its  componenets  including 
PI3K,  PTEN,  AKT,  or  EGFR  or  KRAS.  In  lung  tumorigenesis,  activation  of  the  PI3K/AKT 
pathway  is  thought  to  occur  early  (52)  and  results  in  cell  survival  through  inhibition  of  apoptosis. 
Activation  can  occur  through  the  binding  of  the  SH2-domains  of  p85,  the  regulatory  subunit  of 
PI3K,  to  phosphotyrosine  residues  of  activated  RTKs  (53).  Alternatively,  activation  can  occur 
via  binding  of  PI3K  to  activated  RAS.  Mutation  and  more  commonly,  amplification  of  PIK3CA, 
which  encodes  the  catalytic  subunit  of  phosphatidylinositol  3-kinase  (PI3K),  occurs  most 
commonly  in  squamous  cell  carcinomas  (20,  54-56).  AKT,  a  serine/threonine  kinase  that  acts 
downstream  from  PI3K  can  also  have  mutations  that  lead  to  pathway  activation.  One  of  the 
primary  effectors  of  AKT  is  mTOR,  a  serine/threonine  kinase  involved  in  regulating 
proliferation,  cell  cycle  progression,  mRNA  translation,  cytoskeletal  organization,  and  survival 
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(57).  The  tumor  suppressor  PTEN,  which  negatively  regulates  the  PI3K/AKT  pathway  via 
phosphatase  activity  on  phosphatidylinositol  3,4,5-trisphosphate  (PIP3),  a  product  of  PI3K  (58) 
is  commonly  suppressed  in  lung  cancer  by  inactivating  mutations  or  loss  of  expression  (59,  60). 

NKX2-1  (TITF1)  -  a  lung  cancer  lineage  dependent  oncogene 

Genome  wide  screens  for  DNA  copy  number  changes  in  primary  NSCLCs  found  multiple 
examples  of  amplification  at  14ql3.3  -  and  subsequent  functional  analysis  (siRNA  knockdowns 
in  NSCLCs)  identified  NKX2-1  (also  termed  TITF1 )  as  the  most  likely  target  of  amplification  in 
lung  cancer  (14,  15,  61).  NKX2-1  encodes  a  lineage-specific  transcription  factor  essential  for 
branching  morphogenesis  in  lung  development  and  the  formation  of  type  II  pneumocytes,  the 
cells  lining  lung  alveoli  (62,  63).  Amplification  of  tissue-specific  transcription  factors  in  cancer 
has  been  observed  in  AR  in  prostate  cancer  (64),  MITF  in  melanoma  (65),  and  ESR1  in  breast 
cancer  (66).  These  findings  have  led  to  the  development  of  a  ‘lineage-dependency’  concept  in 
tumors  (67)  whose  survival  and  progression  of  a  tumor  is  dependent  upon  continued  signaling 
through  a  specific  lineage  pathways  (i.e.  abnormal  expression  of  pathways  involved  in  normal 
cell  development)  rather  than  continued  signaling  through  the  pathway  of  oncogenic 
transformation  as  seen  with  oncogene  addiction  (26). 

EML4-ALK  fusion  proteins 

Oncogenic  fusion  proteins  created  by  recurrent  chromosomal  translocations  are  generally  not 
common  in  solid  tumors  such  as  lung  cancer;  however,  recent  studies  indicate  this  infrequency 
may  be  attributable  to  the  difficulties  in  detection.  The  fusion  of  PTK  echinoderm  microtubule- 
associated  protein  like-4  (EML4)-anaplastic  lymphoma  kinase  (ALK)  was  recently  associated 
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with  lung  cancer  (68)  and  occurs  in  approximately  7%  of  NSCLCs  (68-70).  Fusing  with  EML4 
induces  a  significant  transforming  potential  in  ALK.  While  wildtype  ALK  is  thought  to  undergo 
transient  homodimerization  in  response  with  specific  ligand  binding,  EML4-ALK  is 
constitutively  oligomerized  resulting  in  persistent  mitogenic  signaling  and  ultimately  malignant 
transformation  (71).  Additionally,  EML4-ALK  generally  appears  to  be  mutually  exclusive  to 
that  of  EGFR  or  KRAS  mutations  in  NSCLC  and  is  more  common  in  never  or  former  smokers 
(72). 

Tumor  suppressor  genes  (TSGs)  and  growth  inhibitory  pathways 

Loss  of  TSG  function  is  an  important  step  in  lung  carcinogenesis  process  and  usually  both  alleles 
need  to  be  inactivated.  Generally,  loss  of  heterozygosity  (LOH)  inactivates  one  allele  through 
chromosomal  deletion  or  translocation,  and  point  mutation,  epigenetic  or  transcriptional 
silencing  inactivates  the  second  allele  (73,  74).  In  lung  cancer,  commonly  inactivated  TSGs 
include  TP 53,  RBI,  CDKN2A,  FHIT,  RASSF1A  and  PTEN. 

The  p53  pathway 

TP 5 3  (17pl3)  encodes  a  phosphoprotein  which  prevents  accumulation  of  genetic  damage  in 
daughter  cells.  In  response  to  cellular  stress,  p53  induces  the  expression  of  downstream  genes 
such  as  cyclin-dependent  kinase  (CDK)  inhibitors  which  regulate  cell  cycle  checkpoint  signals, 
causing  the  cell  to  undergo  G1  arrest  and  allowing  DNA  repair  or  apoptosis  (74).  p53 

inactivating  mutations  are  the  most  common  alterations  in  cancer,  especially  lung  cancer,  where 
17pl3  frequently  demonstrates  hemizygous  deletion  and  mutational  inactivation  in  the  remaining 
allele  (75-77).  Regulation  of  p53  can  occur  through  the  oncogene  MDM2,  which  reduces  p53 
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levels  through  degradation,  and  the  pl4ARF  isoform  of  CDKN2A,  which  acts  as  a  tumor 
suppressor  by  inhibiting  MDM2.  As  such,  the  genes  that  encode  MDM2  and  pl4ARF  are  altered 
in  lung  cancer  with  amplification  of  MDM2  seen  in  6%  of  NSCLCs  (78)  and  loss  of  pl4ARF 
expression  in  approximately  40%  and  65%  of  NSCLCs  and  SCLCs,  respectively  (79,  80). 
Restoration  of  p53  expression  in  vivo  has  been  achieved  with  p53  gene  therapy  of  lung  cancer 
patients  in  a  subpopulation  of  tumor  cells  (81). 

The  CDKN2A/RB  pathway 

The  CDKN2A-RB1  pathway  controls  G1  to  S  phase  cell  cycle  progression. 
Hypophosphorylated  retinoblastoma  (RB)  protein,  encoded  by  RBI,  halts  the  Gl/S  phase 
transition  by  binding  to  the  transcription  factor  E2F1.  This  tumor  suppressing  effect  can  be 
inhibited  by  hyperphosphorylation  of  RB  by  CDK-CCND1  complexes  (complexes  between 
CDK4  or  CDK6  and  CCND1),  and  in  turn,  formation  of  CDK-CCND1  complexes  can  be 
inhibited  by  CDNK2A  (82).  Nearly  all  constituents  of  the  CDKN2A/RB  pathway  have  been 
shown  to  be  altered  in  lung  cancer  through  mutations  ( CDK4  and  CDKN2A),  deletions  (RBI  and 
CDKN2A ),  amplifications  ( CDK4  and  CCDN1 ),  methylation  silencing  ( CDKN2A  and  RBI),  and 
phosphorylation  (RB)  (83-88). 

Chromosome  3p  TSGs 

Loss  of  one  copy  of  chromosome  3p  is  one  of  the  most  frequent  and  early  events  in  human 
cancer,  found  in  96%  of  lung  tumors  and  78%  of  lung  preneoplastic  lesions  (89).  Mapping  of 
this  loss  identified  several  genes  with  functional  tumor  suppressing  capacity  including  FHIT 
(3pl4.2),  RASSF1A,  TUSC2  (also  called  FUS1),  and  semaphorin  family  members  SEMA3B  and 
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SEMA3F  (all  at  3p21.3),  and  RARfi  (3p24).  In  addition  to  LOH  or  allele  loss,  some  of  these  3p 
genes  ( FHIT RASSF1A,  SEMA3B  and  RAR/3)  often  exhibit  decreased  expression  in  lung  cancer 
cells  by  means  of  epigenetic  mechanisms  such  as  promoter  hypermethylation  (90-94). 
Additionally,  FHIT,  RASSF1A,  TUSC2,  and  SEMA3B  will  reduce  growth  when  re-introduced 
into  lung  cancer  cells.  FHIT,  located  in  the  most  common  fragile  site  in  the  human  genome 
( FRA3B ),  has  been  shown  to  induce  apoptosis  in  lung  cancer  (95).  RASSF1A  can  induce 
apoptosis,  as  well  as  stabilize  microtubules,  and  affect  cell  cycle  regulation  (96).  The  tumor 
suppressing  effect  of  TUSC2  is  thought  to  occur  via  through  inhibition  of  protein  tyrosine 
kinases  such  as  EGFR,  PDGFR,  c-Abl,  c-Kit,  and  AKT  (97)  as  well  as  inhibition  of  MDM2- 
mediated  degradation  of  p53  (98).  The  candidate  TSG  SEMA3B  encodes  a  secreted  protein 
which  can  decrease  cell  proliferation  and  induce  apoptosis  when  re-expressed  in  lung,  breast  and 
ovarian  cancer  cells  (90,  91,  99,  100)  in  part,  by  inhibiting  the  AKT  pathway  (101).  Another 
family  member,  SEMA3F  may  inhibit  vascularization  and  tumorigenesis  by  acting  on  VEGF  and 
ERK1/2  activation  (102,  103)  and  RAR/3  exerts  its  tumor  suppressing  function  by  binding 
retinoic  acid,  thereby  limiting  cell  growth  and  differentiation. 

LKB1 

The  serine/threonine  kinase  LKB1  (also  called  STK11)  is  inactivated  in  -30%  of  lung  cancers 
and  often  correlates  with  KRAS  activation  (104),  resulting  in  the  promotion  of  cell  growth.  It 
functions  as  a  TSG  by  regulating  cell  polarity,  differentiation,  and  metastasis  and  can  regulate 
cell  metabolism  (105).  It  has  also  been  reported  to  inhibit  the  mTOR  pathway  (106). 

Epigenetic  regulation 
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Genetic  abnormalities  are  associated  with  changes  in  DNA  sequence  however  epigenetic  events 
may  lead  to  changes  in  gene  expression  without  any  changes  in  DNA  sequence  and  therefore, 
importantly,  the  latter  are  potentially  reversible  (107).  Aberrant  promoter  hypermethylation  is  an 
epigenetic  change  that  occurs  early  in  lung  tumorigenesis  and  is  found  both  in  genes  that 
normally  undergo  methylation  in  response  to  ageing,  as  well  as  in  genes  that  normally  remain 
unmethylated  regardless  of  age  (108).  Gains  of  DNA  methylation  in  a  normally  unmethylated 
promoter  region  of  a  gene  results  in  silencing  of  gene  transcription  and  is  therefore  a  common 
method  for  the  inactivation  of  tumor  suppressor  genes.  In  lung  cancer,  many  genes  have  been 
found  to  be  silenced  by  promoter  hypermethylation  (summarized  in  Table  1).  They  include 
genes  involved  in  tumor  suppression,  tissue  invasion,  DNA  repair,  detoxification  of  tobacco 
carcinogens,  and  differentiation.  Recent  advances  in  whole-genome  microarray  profiling  have 
allowed  researchers  to  globally  study  DNA  methylation  patterns  in  lung  cancer,  the  results  of 
which  have  led  to  suggestions  that  the  role  of  methylation  in  lung  tumorigenesis  has  been 
underestimated  (109-112).  Restoration  of  expression  of  epigenetically  silenced  genes  is  a  new 
targeted  therapeutic  approach.  Histone  deacetylation  is  an  example  of  epigenetic  change  that  can 
inhibit  gene  expression.  Histone  deacetylase  inhibitors  (HDACs)  are  being  studied  for  the 
treatment  of  lung  cancer  and  function  by  reversing  gene  silencing  by  inhibiting  histone 
deacetylation  (Figure  1  and  Table  2). 

Micro RN A  -mediated  regulation  of  lung  cancer 

MicroRNAs  (miRNAs)  are  a  recently  identified  class  of  non-protein  encoding  small  RNAs 
present  in  the  genomes  of  plants  and  animals.  Ranging  in  size  from  20-25  nucleotides,  miRNAs 
are  small  RNA  molecules  that  are  capable  of  regulating  gene  expression  by  either  direct  cleavage 
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of  a  targeted  mRNA  or  inhibiting  translation  by  interacting  with  the  3’  untranslated  region 
(UTR)  of  a  target  mRNA.  They  are  considered  to  play  an  important  role  in  the  pathogenesis  of 
cancer  -  as  either  oncogenes  or  TSGs  -  due  to  abnormal  expression  found  in  several  types  of 
cancer,  including  lung  cancer  (113-121).  Additionally,  more  than  50%  of  miRNAs  are  located  in 
cancer-associated  genomic  regions  or  fragile  sites  (122,  123). 

As  observed  for  analyses  on  mRNA,  protein  and  methylation  patterns  in  lung  cancer,  miRNA 
microarrays  have  enabled  the  identification  of  many  lung  cancer-associated  miRNAs  (120,  121, 
124-132).  One  of  the  most  widely-studied  miRNAs  in  lung  cancer  is  the  let-7  miRNA  family. 
Functioning  as  a  tumor  suppressor,  it  has  been  shown  to  regulate  N-RAS,  K-RAS  and  HMGA2 
(133,  134)  via  binding  to  the  let-7  binding  sites  in  their  respective  3’  UTRs  (133,  135).  It  is 
frequently  under-expressed  in  lung  tumors,  particularly  NSCLC,  compared  to  normal  lung,  and 
decreased  expression  has  also  been  associated  with  poor  prognosis  (120,  125).  Induction  of  let-7 
miRNA  expression  has  been  found  to  inhibit  growth  in  vitro  (120,  134,  136,  137)  and  reduce 
tumor  development  in  a  murine  model  of  lung  cancer  (137,  138).  In  addition  to  let-1,  other 
miRNAs  with  suggested  tumor  suppressing  effects  in  lung  cancer  include  miR-126,  miR-29a/b/c, 
miR-1  (125-128),  and  recently,  miR-128b  was  reported  to  be  a  direct  regulator  of  EGFR  with 
frequent  LOH  occurring  in  NSCLC  cell  lines  (129).  Oncogenic  miRNAs  found  to  be  over¬ 
expressed  in  lung  cancer  include  the  miR-1 7-92  cluster  of  seven  miRNAs  (with  suggested  targets 
that  include  PTEN  and  RB),  miR-205,  miR-21,  and  miR-1 55  (121,  130). 

Lung  cancer  stem  cells  and  Hedgehog,  Notch  and  Wnt  Signaling 

The  Hedgehog  (HH),  Wnt  and  Notch  signaling  pathways  are  important  in  normal  lung 
development  -  specifically  progenitor  cell  development  and  pulmonary  organogenesis  - 
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yhowever,  they  are  now  also  being  studied  in  regards  to  their  role  in  tumor  development  (Figure 
2).  These  signaling  pathways  are  thought  to  be  involved  in  the  regulation  on  stem/progenitor  cell 
self-renewal  and  maintenance,  and  while  this  process  is  normally  tightly  regulated  process,  genes 
that  comprise  these  pathways  are  often  mutated  in  human  cancers  (139-141),  leading  to  abnormal 
activation  of  downstream  effectors.  In  relation  to  cancer  treatment,  cancer  stem  cells  are  of  great 
importance  because  they  are  thought  to  be  resistant  to  cytotoxic  therapies.  If  correct,  this 
presents  a  need  for  effective  therapies  against  these  self-renewal  signaling  pathways. 

In  the  HH  pathway,  increased  signaling  results  in  activation  of  the  GLI  oncogenes  (GLI1,  GLI2, 
and  GLI3)  that  can  regulate  gene  transcription  (142-144).  The  HH  signaling  pathway  was 
originally  shown  to  have  persistent  activation  in  SCLC  with  high  expression  of  SHH,  PTCH,  and 
GLI1  (145)  but  an  important  role  in  NSCLC  was  also  recently  demonstrated  (146).  The  Notch 
signaling  pathway  is  important  in  cell  fate  determination  but  can  also  promote  and  maintain 
survival  in  many  human  cancers  (147-150).  A  recent  study  in  mammary  stem  cells  suggests  the 
cytokine  IL-6  may  function  as  a  regulator  of  self  renewal  in  normal  and  tumor  mammary  stem 
cells  through  the  Notch  pathway  through  upregulation  of  the  Notch-3  receptor  (151),  which  is 
expressed  in  ~40%  of  resected  lung  cancers  (152).  The  multifunctional  cytokine  IL-6  is 
involved  in  activation  of  JAK  family  of  tyrosine  kinases  (153),  which  in  turn  activate  multiple 
pathways  through  signaling  molecules  such  as  STAT3,  MAPK,  and  PI3K  (154).  In  lung 
adenocarcinomas,  activated  mutant  EGFR  has  also  been  shown  to  induce  levels  of  IL-6  leading 
to  activation  of  STAT3  (155).  The  Wnt  pathway  has  critical  roles  in  organogenesis,  cancer 
initiation  and  progression,  and  maintenance  of  stem  cell  pluripotency.  In  NSCLC,  studies  have 
found  dysregulation  of  Wnt  pathway  members  such  as  Wntl,  Wnt2  and  Wnt7a,  as  well  as 
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upregulation  of  Wnt  pathway  agonists  (Dvl  proteins,  LEF1,  and  Ruvbl  1)  and  underexpression  or 
silencing  of  antagonists  (WIF-1,  sFRPl,  CTNNBIP1,  and  WISP2)  (156-162). 

Human  papilloma  virus-mediated  lung  cancer 

Human  papilloma  virus  (HPY)  has  been  identified  in  tumors  from  many  organs,  not  just 
gynecological  tumors.  Nearly  thirty  years  ago  it  was  suggested  to  be  a  risk  factor  for  lung 
cancer,  particularly  squamous  cell  carcinoma  (163)  and  since  then,  many  studies  have 
investigated  the  role  of  HPV  in  lung  cancer  and  have  reported  considerable  geographical 
variation.  A  recent  meta-analysis  of  53  publications  comprising  4,508  cases  found  the  mean 
incidence  of  HPV  in  lung  cancer  was  25%  and  was  detected  in  all  subtypes  of  lung  cancer,  not 
just  squamous  cell  (164).  Studies  from  Europe  and  America  had  a  lower  incidence  of  15-17% 
while  Asian  lung  cancer  cases  reported  a  mean  incidence  of  38%.  This  observed  high  penetrance 
of  HPV  in  lung  cancer  suggests  more  research  is  required  to  elucidate  its  role  in  lung  cancer 
pathogenesis  however,  considering  the  significant  variation  observed  between  studies  of  cases 
from  the  same  geographical  location  subsequent  studies  will  need  to  have  large  sample  and  a 
detailed  study  design. 

Conclusion 

Genetic  and  epigenetic  mechanisms  underlying  lung  cancer  development  and  progression 
continue  to  emerge.  Over  the  past  decade,  research  into  the  biology  of  many  diseases  has  been 
spearheaded  by  the  development  of  whole-genome  microarray  technology,  allowing  the 
simultaneous  analysis  of  expression,  copy-number  and  SNPs  across  thousands  of  genes.  In  lung 
cancer,  gene  expression  studies  have  uncovered  novel  genes  and  pathways,  as  well  as  identified 
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gene  signatures  that  can  better  predict  patient  prognosis,  response  to  treatment,  and  histology 
(165-168),  reviewed  (169,  170).  High  resolution  mapping  of  alterations  in  the  lung  cancer 
genome  has  been  able  to  identify  single  genes  as  targets  of  genomic  gain  or  loss  through 
improved  definition  of  known  aberrant  regions  or  by  identification  of  focal  alterations 
undetectable  with  earlier  technology  (15,  171-173).  Large-scale  sequencing  and  SNP  analyses 
have  also  led  to  the  identification  of  novel  somatic  mutations  or  SNPs  in  the  lung  cancer  genome 
(7-9,  174).  While  such  genome-wide  screens  have  the  capacity  of  identifying  novel  genes  or 
interactions  in  relation  to  lung  cancer,  the  functional  relevance  of  these  findings  still  need  to  be 
elucidated  using  in  vitro  model  systems  such  as  tumor  cell  lines  or  immortalized  human 
bronchial  epithelial  cells.  These  systems  allow  the  characterization  of  single  or  sequential 
genetic  alterations  in  relation  to  the  development,  maintenance,  and  progression  of  lung  cancer 
and  represent  a  crucial  contribution  in  the  understanding  of  the  molecular  biology  of  lung  cancer. 
Functional  characterization  of  genetic  alterations  and  the  signaling  pathways  with  which  they 
interact,  has  enabled  the  development  of  targeted  therapies  for  the  treatment  of  lung  cancer 
(Table  2).  Ranging  from  drugs  in  clinical  use  to  those  in  clinical  trial,  they  are  directed  against 
all  known  pathways  of  lung  cancer  initiation  and  progression  such  as  proliferation,  inhibition  of 
apoptosis,  angiogenesis,  and  invasion.  This  chapter  has  outlined  some  of  the  significant 
molecular  alterations  known  to  be  involved  in  the  initiation  and/or  progression  of  lung  cancer. 
By  characterizing  these  aberrations  researchers  endeavor  to  improve  the  detection,  diagnosis, 
treatment,  and  prognosis  of  lung  cancer  through  the  integration  of  clinical  and  biological  factors 
-  to  achieve  personalized  medicine. 
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Figure  1:  Signaling  pathways  involved  in  NSCLC  and  SCLC.  Aberrant  signaling  resulting 


in  activation  of  growth  stimulatory  pathways  or  interference  of  growth  inhibitory  pathways  has 
been  implicated  in  lung  cancer  pathogenesis.  Oncogenic  activation  typically  occurs  through 
amplification,  mutation,  rearrangement  or  over-expression.  In  lung  cancer,  commonly  activated 
oncogenes  include  EGFR,  ERBB2 ,  MYC,  KRAS,  MET,  CCND1,  CDK4  and  BCL2.  In  contrast  to 
oncogene  activation,  loss  of  TSG  function  is  thought  to  require  inactivation  of  both  alleles  - 
generally,  LOH  of  one  allele,  and  point  mutation,  epigenetic  or  transcriptional  silencing  will 
inactivate  the  second  allele.  In  lung  cancer,  commonly  inactivated  TSGs  include  TP53,  RBI, 
CDKN2A,  FHIT  and  PTEN.  Although  EGFR  plays  a  major  role  in  lung  cancer  pathogenesis, 
several  other  receptor  tyrosine  kinases  have  been  implicated  such  as  members  of  the  platelet- 
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derived  growth  factor  receptor  (PDGFR)  family  c-KIT  (expression  is  common  in  SCLC  but  rare 
in  NSCLC  (175,  176))  and  MET  (potentially  associated  with  acquired  resistance  to  EGFR  TKIs). 
Table  2  lists  targeted  therapeutic  agents  which  have  been  developed  against  many  components  of 
these  signaling  pathways. 
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Hedgehog  pathway 


Wnt  pathway 


Notch  pathway 


Figure  2:  Stem  cell  self-renewal  pathways  and  therapeutic  strategies  to  block  these 
pathways  in  cancer.  Notch,  Wnt,  and  Hedgehog  (HH)  are  stem  cell  self-renewal  pathways  that 
are  often  deregulated  and  aberrantly  activated  in  lung  cancer,  thus  representing  key  therapeutic 
targets.  Modulation  of  these  pathways  can  be  achieved  at  different  levels.  In  general,  it  is 
possible  to  interfere  with  ligand-receptor  interactions  by  using  ligand  antagonists  or  receptor 
decoys,  by  blocking  ligand-induced  conformational  changes,  or  disrupting  protein-protein 
interaction  of  complexes  involved  in  the  activation  of  nuclear  transcription.  More  specifically, 
strategies  targeting  the  HH  pathway  include  the  use  of  antagonists  for  ligands  (HH)  or  receptors 
(Smoothened,  SMO)  as  well  as  cyclopamine,  a  naturally  occurring  compound  that  inhibits  SMO. 
Antibodies  against  WNT  ligands  or  the  receptor  FZD  and  inhibitors  of  protein  complex 
formation  can  be  used  to  inhibit  the  Wnt  pathway.  Anti-ligand  antibodies  are  used  as  potential 
agents  to  block  the  Notch  pathway.  Cleavage  of  Notch  receptors  by  ADAM  proteases  and  y- 
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secretase  is  required  to  activate  this  pathway  and  inhibitors  of  these  enzymes  are  being  tested  for 
their  possible  therapeutic  implications.  At  present,  two  drugs  targeting  these  pathways  are  in 
early  clinical  trials  for  treatment  of  some  cancer  types:  phase  I  trials  are  studying  the  effects  of  a 
HH  antagonist  (GDC-0449)  on  solid  tumors  and  MK-0752,  a  y-secretase  inhibitor,  is  being  tested 
in  patients  with  T-cell  acute  lymphoblastic  leukemia,  breast  cancer,  and  central  nervous  system 
tumors.  Pathways  are  depicted  schematically  and  some  components  were  omitted  (dashed  lines) 
for  simplicity.  ADAM,  a  disintegrin  and  metalloprotease  domain;  APC,  adenomatous  polyposis 
coli;  P-Cat,  P-catenin;  CSKla,  cyclin  suppressing  kinase  la;  CSL,  C-promoter  binding  factor  1, 
suppressor  of  hairless,  and  Lagl  protein  complex;  DSH,  disheveled;  FZD,  frizzled;  GLI,  glioma- 
associated  oncogene;  GLIACT,  active  form  of  GLI;  GLIREP,  repressor  form  of  GLI;  GSK3p, 
glycogen  synthase  kinase  3P;  HH,  hedgehog;  NICD,  Notch  intracellular  domain;  P, 
phosphorylation;  PTCH,  patched;  SMO,  smoothened;  TACE,  TNF-a-converting  enzyme; 
TCF/HNF4A,  transcription  factor. 
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Table  1:  Common  genetic  alterations  found  in  lung  cancer3 

Gene 

SCLC  (%) 

All 

NSCLC  (%) 
Adeno¬ 
carcinoma 

Squamous 

cell 

References 

Oncogenic  alterations 

Mutation 

BRAF 

Rare 

1-3 

1-5 

3 

(177,  178) 

EGFR 

Rare 

~20 

10-40 

Rare 

(177,  179-182) 

ErbB2  (HER2) 

Rare 

2 

4 

Rare 

(177,  183) 

KRAS 

Rare 

10-30 

15-35 

<5 

(177,  184-186) 

MET 

13 

21 

14 

12 

(6) 

PIK3CA 

Rare 

1-5 

2-3 

2-7 

(56,  187-189) 

Amplification 

EGFR 

Rare 

20-30 

15 

30 

(6) 

ErbB2  (HER2) 

5-30 

2-23 

6 

2 

(6,  183,  190,  191) 

MDM2 

6-24 

14 

22 

(192) 

MET 

7-21 

20 

21 

(193,  194) 

MYC 

18-30 

8-22 

(195-198) 

NKX2-1  (TITF1) 

Rare 

12-30 

10-15 

3-15 

(6,  14,  15) 

PIK3CA 

~5 

9-17 

6 

33-36 

(6,  56) 

Increase  in  protein  expression 

CRK 

8-30 

8-30 

(199) 

BCL2 

75-95 

10-35 

(186,  200,  201) 

CCND1 

0 

43 

35-55 

30-35 

(85,  202) 

CD44 

Rare 

Common 

3 

48 

(203) 

c-KIT 

46-91 

Rare 

(204-210) 

EGFR 

Rare 

50-90 

40-65 

60-85 

(29-32,  186) 

ErbB2  (HER2) 

<10 

20-35 

16-38 

6-16 

(183,  186,  207,  211-213) 

MYC 

10-45 

<10 

(50,  214-216) 

PDGFRA 

65 

2-100 

100 

89 

(217-220) 

Tumor  suppressing  alterations 

Mutation 

CDKN2A  (pi  6) 

<1 

10-40 

(186) 

LKB1 

Rare 

30 

34 

19 

(6,  186) 

p53 

75-90 

50-60 

50-70 

60-70 

(186,  221-223) 

PTEN 

15-20 

<10 

(186) 

Rb 

80-100 

20-40 

(186,  224-226) 

Deletion/LOHb 

CDKN2A  (pi  6) 

37 

75-80 

(84,  227,  228) 

FHIT 

100 

55-75 

(227-229) 

p53 

86-93 

74-86 

(227,  228) 

Rb 

93 

62 

(227,  228) 

Loss  of  protein  expression 

CAV1 

95 

24 

(230) 

CDKN2A(p14ARF) 

65 

40-50 

(84,  231) 

CDKN2A  (pi  6) 

3-37 

30-79 

~55 

60-75 

(227,  228) 

FHIT 

80-95 

40-70 

(186,  227,  228) 

PTEN 

25-74 

77 

70 

(60,  231) 

37 


Rb 

90 

15-60 

23-57 

6-14 

(228) 

TUSC2  (FUS1) 

100 

82 

79 

87 

(232) 

Tumor-acquired  DNA  Methylation 

APC 

15-26 

24-96 

(93,  94,  233) 

CAV1 

93 

9 

(230) 

CDH1 

60  40 

20-35 

(94,  233-235) 

CDH13 

15-20 

45 

(93,  94) 

CDKN2A  (p14ARF) 

ndb 

6-8 

(94) 

CDKN2A  (pi  6) 

5,  0 

15-41 

21-36 

24-33 

(236-238) 

DAPK1 

nd 

16-45 

(94,  233,  239) 

FHIT 

64 

37 

(93,  94) 

GSTP1 

16 

7-12,  15 

(94,  240) 

MGMT 

16 

16-27,  10 

(94,  233) 

PTEN 

26 

24 

30 

(60) 

RARp 

45-70 

40-43 

(93,  94,  241) 

RASSF1A 

72-85 

15-45 

31 

43 

(90,  94,  96,  233,  242,  243) 

SEMA3B 

nd 

41-50 

46 

47 

(90,  91) 

TIMP3 

nd 

19-26 

(94) 

Telomeres 

Telomerase  activity 

75-100 

50-80 

65-85 

80-90 

(11-13,  186,  244) 

Chromosomal  aberrations 

Large-scale  loss 

Ip,  3p, 

3p,  5q,  8p, 

2q,  3p,  4q, 

3p,  4q,  9p, 

(24,  25,  55,  227,  245-248) 

4p,  4q, 

9p,  13q, 

8p,  9p,  9q, 

lOp,  lOq, 

5q,  8p, 

17p,  18q, 

1 0p,  1 0q, 

18,  20 

1 Oq, 13q, 

19p,  19q, 

13q, 15q, 

17p 

21q,  22q 

18,  20 

Focal  deletions 

2q22.1 , 3p14.2,  3q25.1,  5q1 1 .2, 

7q1 1 .22,  7q34,  9p23,  9p21.3, 

10q23.31,  1 1  q  1 1 ,  13q12.11,  13q14.2, 

(15,  171,  172) 

13q32.2,  18q23,  21  pi  1.2 

Large-scale  gain 

3q,  5p, 

1  q,  3q,  5p, 

5p,  7p,  7q, 

2q,  3q,  5p, 

(24,  25,  55,  227,  245-248) 

8q,  18q 

6p,  7p,  7q, 

8q,  1 1  q. 

7,  8q,  11q, 

8q,  20p, 

19,  20q 

13q,  19, 

20q 

20q 

Focal 

1  p36.32,  1p34.3,  1q32.2,  1q21 .2, 

(15,  171,  172) 

amplifications 

2p24.3,  2q1 1 .2,  2q31 .1 ,  3q26.31 , 
5p15.33,  5p15.31,  5p14.3,  5q31.3, 

6p21 .1 , 7p1 1 .2,  8p12,  8q21 .13, 

8q24.21 ,  10q24.1,  10q26.3,  11q13.3, 

1 2p1 2.1 ,  1 2q  1 3.2,  12q14.1,  12q15, 

1 4q  1 3.3,  14q32.13,  16q22.2,  17q12, 

1 8q  12.1,  1 9q  1 2,  19q13.33,  20q13.32, 

22q1 1 .21 

a  nd,  not  determined 
b  LOH,  loss  of  heterozygosity 
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Table  2:  Targeted  therapies  against  oncogenic  pathways  in  lung  cancer  and 
their  development  in  clinical  trials 

Gene 

Drug 

NSCLC3 

SCLCa 

EGFR 

cetuximab 

ll/lll 

lb 

EGFR 

panitumumab 

II 

net 

EGFR 

matuzumab 

II 

net 

EGFR 

gefitinib 

approved 

II 

EGFR 

erlotinib 

approved 

net 

EGFR,  ErbB2 

lapatinib 

II 

net 

EGFR,  ErbB2 

HKI-272 

II 

net 

EGFR,  ErbB2 

CI-1033 

II 

net 

ErbB2 

trastuzumab 

II 

lb 

VEGF 

bevacizumab 

approved 

II 

VEGFR 

cediranib 

ll/lll 

II 

VEGFR,  EGFR 

vandetanib 

ll/lll 

II 

VEGFR,  PDGFR,  c-KIT 

sunitinib 

ll/lll 

II 

VEGFR,  PDGFR,  c-KIT 

vatalanib 

II 

net 

VEGFR,  PDGFR,  c-KIT 

axitinib 

II 

net 

VEGFR,  PDGFR,  c-KIT 

AMG-706 

ll/lll 

net 

c-KIT,  PDGFR 

Imatinib 

II 

II 

RAS 

tipifarnib 

II 

II 

RAS 

lonafarnib 

III 

net 

RAF,  VEGFR,  PDGFR,  c-KIT 

sorafenib 

ll/lll 

II 

MEK 

CI-1040 

II 

net 

MEK 

PD-325901 

II 

net 

MEK 

AZD6244 

II 

net 

PI3K 

LY294002 

net 

net 

mTOR 

sirolimus 

I/ll 

net 

mTOR 

temsirolimus 

II 

II 

mTOR 

everolimus 

I/ll 

I/ll 

mTOR 

AP23573/deforolimus 

lb 

lb 

BCL2 

oblimersen 

ll/lll 

I/ll 

BCL2 

ABT-737 

net 

net 

SRC 

dasatinib 

II 

II 

Proteasome 

bortezomib 

II 

II 

Proteasome 

NPI-0052 

1 

net 

p53 

p53  peptide  vaccine 

II 

lb 

FUS1 

fusl  liposome  complex 

1 

net 

HDACs 

vorinostat 

II 

I/ll 

HDACs 

romidepsin 

II 

II 

Telomerase 

GRN163L 

1 

lb 

a  I,  Phase  I  clinical  trial;  II,  Phase  II  clinical  trial;  III,  Phase  III  clinical  trial;  approved,  approved  by  the  FDA; 
net,  not  in  a  clinical  trial  at  time  of  manuscript  preparation 
b  In  Phase  I  clinical  trial  for  solid  tumors,  not  specific  to  NSCLC/SCLC 
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Three  recent  genome-wide  association  studies  identified  associations  between 
markers  in  the  chromosomal  region  15q24-25.1  and  the  risk  of  lung  cancer.  We  con¬ 
ducted  a  genome-wide  association  analysis  to  investigate  associations  between 
single-nucleotide  polymorphisms  (SNPs)  and  the  risk  of  lung  cancer,  in  which  we 
used  blood  DNA  from  194  case  patients  with  familial  lung  cancer  and  219  cancer-free 
control  subjects.  We  identified  associations  between  common  sequence  variants  at 
15q24-25.1  (that  spanned  LOCI 23688  [a  hypothetical  gene],  PSMA4,  CHRNA3, 
CHRNA5,  and  CHRNB4)  and  lung  cancer.  The  risk  of  lung  cancer  was  more  than  five¬ 
fold  higher  among  those  subjects  who  had  both  a  family  history  of  lung  cancer  and 
two  copies  of  high-risk  alleles  rs8034191  (odds  ratio  [OR]  =  7.20,  95%  confidence 
interval  [Cl]  =  2.21  to  23.37)  or  rs1051730  (OR  =  5.67,  Cl  =  2.21  to  14.60,  both  of  which 
were  located  in  the  15q24-25.1  locus,  than  among  control  subjects.  Thus,  further 
research  to  elucidate  causal  variants  in  the  15q24-25.1  locus  that  are  associated  with 
lung  cancer  is  warranted. 
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Lung  cancer  can  occur  sporadically  in  peo¬ 
ple  with  no  known  family  history  of  lung 
cancer  or  it  can  occur  in  multiple  members 
of  the  same  family  and  be  designated  as 
familial  lung  cancer.  Evidence  of  a  genetic 
basis  for  susceptibility  to  lung  cancer  has 
been  demonstrated  through  genome-wide 
association  studies  (1-3)  and  segregation 
analyses  (4-9).  We  conducted  a  genome¬ 
wide  association  study  among  individuals 
with  a  familial  history  of  lung  cancer. 
These  individuals  are  members  of  families 
with  three  or  more  first-degree  relatives 
with  lung  cancer  that  were  collected  as  part 
of  the  Genetic  Epidemiology  of  Lung 
Cancer  Consortium  (GELCC). 

For  the  genome-wide  association  study, 
we  genotyped  194  case  patients  with  famil¬ 
ial  lung  cancer  and  219  cancer- free  control 
subjects  by  use  of  the  Affymetrix  500K  or 
Affymetrix  Genome-Wide  Human  SNP 
Array  6.0  (Santa  Clara,  CA)  (Supplementary 
Table  1,  available  online).  To  ensure  genetic 


independence  among  subjects,  one  case 
patient  with  familial  lung  cancer  was  cho¬ 
sen  from  each  high-risk  lung  cancer  family. 
Noncancer  control  subjects  were  obtained 
from  a  combination  of  unaffected  spouses 
from  GELCC  families  (n  =  36)  and  of  unaf¬ 
fected  individuals  from  the  Coriell  Institute 
for  Medical  Research  (Camden,  NJ)  (n  =  11) 
and  the  Fernald  Medical  Monitoring  Pro¬ 
gram  (Fernald,  OH)  (n  =  172).  These  control 
subjects  had  no  blood  relationship  with  any 
selected  case  patients.  To  minimize  possi¬ 
ble  effects  of  cigarette  smoking  and  age, 
smokers  with  an  older  age  were  selected 
mainly  as  control  subjects,  except  for  spousal 
control  subjects  (Supplementary  Table  1, 
available  online).  To  maintain  homogeneous 
population  samples,  only  Caucasian 
subjects  from  the  GELCC,  Coriell  Insti¬ 
tute  for  Medical  Research,  and  the  Fernald 
Medical  Monitoring  Program  were  used  for 
the  association  analysis.  Basic  characteristics 
of  the  GELCC  subjects  are  presented  in 


Supplementary  Table  1  (available  online). 
The  Texas  and  UK  lung  cancer  cohorts 
have  been  described  previously  (1).  Written 
informed  consent  and  institutional  review 
board  approval  were  obtained  for  all  the 
subjects  involved  in  this  study. 

Single-nucleotide  polymorphism  (SNP) 
genotyping  was  done  by  the  Vanderbilt 
University  Microarray  Shared  Resource  by 
following  the  Affymetrix  protocol  (www. 
affymetrix.com).  We  used  the  Affymetrix 
500K  chipset  to  genotype  76  control  sub¬ 
jects  and  73  case  patients  and  the  Affymetrix 
SNP  6.0  array  to  genotype  the  remaining 
137  case  patients  and  164  control  subjects. 
The  samples  used  with  the  two  platforms 
were  randomly  chosen  from  the  GELCC 
collections.  The  Affymetrix  6.0  array 
includes  more  than  906  600  SNPs  and  cov¬ 
ers  97.2%  of  SNPs  presented  in  the 
Affymetrix  500K  chipset  of  500568  SNPs. 
For  the  Affymetrix  500K  SNP  chipset,  a 
confidence  score  of  0.33  was  used  for  geno¬ 
type  determination,  by  using  the  Bayesian 
robust  linear  model  with  the  Mahalanobis 
algorithm  (10);  an  average  genotyping  call 
rate  of  96.9%  was  obtained  across  all  case 
and  control  samples.  For  the  SNP  Array 
6.0,  a  block  size  of  zero  and  a  confidence 
threshold  of  0.1  were  used  for  genotype 
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CONTEXT  AND  CAVEATS 

Prior  knowledge 

Genome-wide  association  studies  have 
identified  associations  between  markers  in 
the  chromosomal  region  15q24-25.1  and 
the  risk  of  lung  cancer. 

Study  design 

A  genome-wide  case-control  association 
analysis  was  used  to  investigate  relation¬ 
ships  between  single-nucleotide  polymor¬ 
phisms  (SNPs)  and  the  risk  of  familial  lung 
cancer. 

Contribution 

Subjects  with  both  a  family  history  of  lung 
cancer  and  two  copies  of  either  of  two  high- 
risk  alleles  in  15q24-25.1  had  a  higher  risk 
of  lung  cancer  than  control  subjects. 

Implications 

Additional  research  is  required  to  identify 
which  genetic  variants  in  the  15q24-25.1 
region  are  associated  with  a  high  risk  of 
lung  cancer. 

Limitations 

Associations  of  risk  alleles  with  nicotine 
dependence  were  not  directly  tested 
because  the  data  were  not  available. 
Smoking  quantity  was  available;  however, 
no  association  between  smoking  quantity 
and  the  high-risk  alleles  was  found.  The 
small  sample  size  may  have  limited  the 
ability  to  detect  a  smaller  effect  size  for  risk 
alleles  among  heterozygotes  with  familial 
lung  cancer. 

From  the  Editors 


determination,  by  using  the  Birdseed  algo¬ 
rithm  (available  at  http://www.broad.mit.edu/ 
mpg/birdsuite/birdseed.html).  We  excluded 
four  samples  with  unexpected  genetic  rela¬ 
tedness  that  was  detected  by  PLINK  soft¬ 
ware  (http://pngu.mgh.harvard.edu/~purcell/ 
plink/)  and  33  samples  with  a  genotype 
call  rate  of  less  than  86%.  Thus,  a  total  of 
413  samples  were  available  for  our  analysis 
(Supplementary  Table  1,  available  online) — 
57  case  patients  and  56  control  subjects 
genotyped  with  the  Affymetrix  500K  chip- 
set,  and  137  case  patients  and  163  control 
subjects  genotyped  with  the  Affymetrix 
SNP  6.0  array.  When  the  frequency 
distribution  of  the  overlapping  SNPs 
in  the  two  genotyping  platforms  was  ana¬ 
lyzed,  no  heterogeneity  between  the 
platforms  was  found  (Supplementary 
Figure  1,  available  online).  Among  the  413 


subjects,  we  detected  no  population  strati¬ 
fication,  as  shown  by  linkage  agglomerative 
clustering  implemented  in  PLINK  software. 
Hardy-Weinberg  equilibrium  for  each  SNP 
was  examined  with  the  software  hweStrata, 
by  use  of  an  exact  test  as  proposed  previ¬ 
ously  (11)  (stratum  number  K  =  1).  All  statis¬ 
tical  tests  were  two-sided.  SNPs  with  an 
exact  P  value  of  .0001  or  less  from  Hardy- 
Weinberg  equilibrium  tests  and  with  a 
minor  allele  frequency  of  less  than  1% 
among  control  subjects  were  excluded  from 
association  analyses.  Thus,  in  the  final 
association  analysis  of  413  samples,  we 
retained  722  376  SNPs  for  the  300  samples 
that  were  genotyped  by  SNP  Array  6.0  and 
399  377  SNPs  for  the  113  samples  that 
were  genotyped  by  Affymetrix  500K.  The 
statistical  significance  of  the  association 
between  SNP  allele  and  disease  status  was 
assessed  primarily  with  Fisher  exact  tests 
(12)  (Supplementary  Figure  2,  available 
online).  The  odds  ratios  (ORs)  of  lung  can¬ 
cer  associated  with  each  SNP  and  95% 
confidence  intervals  (CIs)  were  estimated 
by  allele  and  by  genotype.  We  evaluated 
the  cumulative  distribution  of  P  values  from 
Fisher  exact  tests  in  our  genome-wide  asso¬ 
ciation  study.  As  shown  in  Supplementary 
Figure  3  (available  online),  the  distribution 
of  observed  P  values  was  similar  to  the 
expected  uniform  distribution  [0,  1],  indi¬ 
cating  no  inflation  of  test  statistics  from 
population  structure  or  any  other  form  of 
bias. 

Our  genome-wide  association  study  iden¬ 
tified  several  strong  associations  of  SNPs  on 
chromosomes  1,  3,  6,  9,  12,  15,  and  20  with 
lung  cancer.  One  of  the  most  strongly 
associated  clusters  of  SNPs  was  found  in  a 
160-kb  region  at  15q24-25.1  (Figure  1,  A  and 
B,  and  Supplementary  Figure  1,  available 
online)  that  had  strong  linkage  disequilib¬ 
rium  and  contained  multiple  discrete  hap- 
lotype  blocks  (ie,  regions  with  no  evidence 
of  historical  recombination),  as  defined  by 
examining  the  HapMap  data  from  Utah 
residents  with  ancestry  from  northern  and 
western  Europe  (http://www.hapmap.org/) 
(Figure  1,  C).  These  familial  data  confirm 
the  recently  described  association  between 
15q25.1  and  sporadic  lung  cancer  (1-3).  At 
least  two  common  variants  on  15q24-25. 1, 
SNPs  rs8034191  and  rsl051730,  were 
strongly  associated  with  both  familial  (P  = 
3.74  x  PL4  for  rs8034191  and  P  =  2.90  x 
10“4  for  rs  105 1730)  and  sporadic  (from  the 


Texas  study,  P  =  9.71  x  10  8  for  rs8034191 
and  P  =  1.32  x  10"7  for  rsl051730;  from  the 
UK  study,  P  =  9.90  x  10  9  for  rs8034191 
and  P  =  2.84  x  10-8  for  rsl051730)  lung 
cancer  (Table  1).  These  SNPs  are  in  high 
linkage  disequilibrium  (r2  =  0.87)  and  are  loca¬ 
ted  within  intronic  regions  of  hypothetical 
gene  LOC123688  (rs8034191)  and  of  the 
gene  CHRNA3  (rs  105 1730).  Risk  of  lung 
cancer  among  homozygous  carriers  was 
statistically  significantly  different  from  risk 
among  noncarriers  for  the  two  SNPs — 
rs8034191  and  rs  105 1730 — in  large  spo¬ 
radic  lung  cancer  samples  previously 
reported  from  Texas  and  from  the  United 
Kingdom  (1). 

It  is  worth  noting  that  the  effect  size  of 
the  risk  allele  at  15q24-q25.1  observed  in 
familial  lung  cancer  (for  carrying  two  cop¬ 
ies  of  risk  alleles  of  rs8034191  and 
rs  105 1730,  respectively,  OR  =  3.84,  95% 
Cl  =  1.75  to  8.84,  and  OR  =  3.43,  95% 
Cl  =  1.66  to  7.37)  was  larger  than  that 
observed  in  sporadic  lung  cancer  (for  car¬ 
rying  two  copies  of  the  same  risk  alleles, 
respectively,  OR  =  1.76,  95%  Cl  =  1.42  to 
2.18,  and  OR  =  1.75,  95%  Cl  =  1.42  to 
2.17,  in  the  Texas  study;  and  OR  =  2.09, 
95%  Cl  =  1.60  to  2.73,  and  OR  =  2.05, 
95%  Cl  =  1.57  to  2.70,  in  the  UK  study) 
(Table  1).  We  therefore  tested  heteroge¬ 
neity  between  familial  and  sporadic  lung 
cancers  by  Woolf’s  test  (13).  Statistically 
significant  heterogeneity  was  observed 
between  familial  and  the  Texas  sporadic 
lung  cancer  samples  (P  =  .04)  but  not 
between  familial  and  UK  sporadic  lung 
cancer  samples  (P  =  .12). 

To  rule  out  confounding  effects  of 
smoking  behavior  on  lung  cancer  risk,  the 
association  analysis  was  adjusted  by  sex, 
age,  and  pack-years  of  cigarette  exposure, 
as  continuous  variables  (Supplementary 
Table  2,  available  online).  Association  of 
the  15q24-25.1  locus  with  familial  lung 
cancer  remained  statistically  significant 
(P=  1.03  x  10“3  forrs8034191  andP=  3.10  x 
10“4  for  rs  105 1730)  after  this  adjustment. 
The  adjusted  associations  were  even  stron¬ 
ger  among  those  in  lung  cancer  families  car¬ 
rying  two  copies  of  risk  alleles  of  rs8034191 
(OR  =  7.20,  95%  Cl  =  2.21  to  23.37)  or 
rsl051730  (OR  =  5.67,  95%  Cl  =  2.21  to 
14.60).  Thus,  among  lung  cancer  patients 
with  a  family  history,  alleles  associated  with 
a  high  risk  of  lung  cancer  appear  to  be 
located  on  the  q  arm  of  chromosome  15. 
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We  next  investigated  whether  the  SNPs 
on  15q24-q25.1  that  were  strongly  associ¬ 
ated  with  lung  cancer  acted  additively, 
recessively,  or  dominantly  by  use  of  logistic 
regression  and  Bayesian  information  crite¬ 
ria.  Given  any  two  estimated  models,  the 
model  with  the  lower  value  of  Bayesian 
information  criteria  is  the  one  to  be  pre¬ 
ferred.  The  strongest  associations  were 
found  with  the  recessive  model  (Table  1 
and  Supplementary  Table  3,  available 
online).  In  the  recessive  model,  an  increased 


risk  of  lung  cancer  was  associated  with 
having  two  copies  of  the  high-risk  A  allele 
of  rs  105 1730,  compared  with  having  no 
copies  (OR  =  3.43,  95%  Cl  =  1.66  to  7.37, 
P  value  =  2.90  x  10~4).  However,  an  additive 
model  gave  similar  levels  of  statistical  sig¬ 
nificance  for  many  of  the  most  statistically 
significant  SNPs  in  this  region.  The  homo¬ 
zygous  risk  genotype  appeared  to  occur 
more  frequently  and  to  have  larger  effect 
size  among  familial  lung  cancer  samples 
(OR  =  3.84  for  rs  105 1730)  than  among 
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Figure  1.  Association  between  chromosome  15q24-25.1  and  lung  cancer.  Associations  are 
expressed  as  -log(P);  P  values  were  from  Fisher  exact  tests.  All  statistical  tests  were  two-sided. 

A)  Association  analysis  of  194  familial  case  patients  and  219  control  subjects  in  chromosome  15. 

B)  Enhanced  view  of  association  analysis  in  the  75-  to  79-Mb  region  of  chromosome  15.  Several 
common  variants,  including  rs8034191  and  rs1051730,  cluster  on  15q24-25.1  and  show  strong 
associations  with  familial  lung  cancer.  C)  Physical  map  and  haplotype  blocks  (ie,  regions  with  no 
evidence  of  historical  recombination).  Pairwise  linkage  disequilibrium,  measured  as  D'  (a  mea¬ 
surement  of  the  nonrandom  association  of  alleles  at  two  loci),  was  calculated  with  HapMap  data 
from  Utah  residents  with  ancestry  from  northern  and  western  Europe  by  the  methods  of  Gabriel 
(21)  as  implemented  with  Haploview  software  (22).  Shading  represents  the  magnitude  and  sta¬ 
tistical  significance  of  the  pairwise  linkage  disequilibrium,  with  a  white-to-red  gradient  reflecting 
lower  to  higher  linkage  disequilibrium  values.  The  single-nucleotide  polymorphisms  rs8034191 
and  rs1051730  were  contained  in  the  84-kb  block  4. 


sporadic  lung  cancer  samples  (OR  =  1.75 
for  rs  1051730  from  the  Texas  study  and 
OR  =  2.09  for  rs  105 1730  in  the  UK  study). 
The  smaller  sample  size  of  the  familial 
study  was  less  powerful  to  detect  the 
smaller  risks  among  hetero zygotes.  This 
limitation  and  the  higher  frequency  of 
homozygotes  in  the  highly  ascertained 
families  with  familial  lung  cancer  may 
account  for  the  difference  in  best  fitting 
models  at  the  15q24  locus  in  familial  vs 
sporadic  data  sets.  A  genome-wide  associa¬ 
tion  study  on  a  larger  sample  of  familial 
case  patients  would  provide  increased 
power.  However,  both  familial  and  spo¬ 
radic  analyses  support  the  presence  of  a 
risk  allele  at  15q24-25.1. 

In  addition  to  analyses  of  single  SNPs, 
we  performed  a  haplotype-based  associa¬ 
tion  analysis  on  the  1 5q24-q25. 1  locus. 
Haplotypes  at  the  15q24-25.1  locus  region 
were  inferred  by  use  of  the  fastPHASE 
computer  program  (14).  To  exhaustively 
exploit  haplotype  information,  we  then 
subject  alleles  (contiguous  sets  of  markers) 
from  sliding  windows  of  all  sizes  to  haplo¬ 
type  association  tests  (15).  In  the  haplotype 
analysis,  a  common  haplotype  with  a  fre¬ 
quency  larger  than  5  %  was  first  identified 
and  treated  as  one  category  and  the  other 
variants  as  another  category.  The  Fisher 
exact  test  was  then  applied  to  determine 
statistical  significance  (12).  Multiple  haplo¬ 
types  at  the  15q24-25.1  locus  were  statisti¬ 
cally  significantly  associated  with  familial 
lung  cancer  risk  (Supplementary  Table  4, 
available  online).  The  most  statistically  sig¬ 
nificant  association  was  with  the  haplotype 
A-T  consisting  of  both  rs7163730  and 
rs4461039  (P  =  1.03  x  10“4).  In  the  single¬ 
marker  analyses,  the  two  SNPs  with  the 
most  statistically  significant  association  with 
lung  cancer  were  rs7163730  (P  =8.71  x  10“5) 
and  rs4461039  (P  =  8.67  x  10"5). 

Using  GELCC  familial  lung  cancer 
samples  and  a  genome-wide  association 
study,  we  identified  several  common  vari¬ 
ants  at  1 5  q2 4-2  5 . 1  and  confirmed  the  asso¬ 
ciation  between  genetic  variants  on  the  q 
arm  of  chromosome  15  and  sporadic  lung 
cancer  (1-3).  When  we  adjusted  for  smok¬ 
ing  and  other  covariates,  a  statistically  sig¬ 
nificant  association  remained  (Supplementary 
Table  2,  available  online).  This  observation 
is  consistent  with  previous  analyses  (1-3),  in 
which  a  direct  role  for  these  variants  in  lung 
cancer  was  postulated.  In  those  analyses,  the 
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Table  1.  Associations  of  the  single-nucleotide  polymorphisms  rs8034191  and  rs1051730  genotypes  with  lung  cancer* 


Samplest 

Genotype 

Frequency,  % 

Control  subjects  Case  patients 

P  value! 

OR  (95%  Cl) 

GELCC 

rs8034191 

AA 

50.00 

36.64 

AG 

41.67 

39.69 

.362 

1.30  (0.75  to  2.24) 

GG 

8.33 

23.66 

3.74  x  10“4 

3.84(1.75  to  8.84) 

rs1051730§ 

GG 

48.11 

37.57 

AG 

44.81 

43.35 

.377 

1.23  (0.78  to  1.96) 

AA 

7.08 

19.08 

2.90  x  10“4 

3.43  (1.66  to  7.37) 

Texas 

rs8034191 

AA 

44.10 

37.01 

AG 

44.85 

46.68 

3.09  x  10-3 

1.24  (1.07  to  1.43) 

GG 

11.05 

16.32 

9.71  x  10“8 

1.76  (1.42  to  2.18) 

rs1051730 

GG 

44.31 

36.82 

AG 

44.54 

46.95 

1.04  x  10“3 

1.27  (1.10  to  1.47) 

AA 

11.15 

16.23 

1.32  x  10“7 

1.75  (1.42  to  2.17) 

United  Kingdom 

rs8034191 

AA 

46.67 

36.59 

AG 

43.23 

46.86 

1.72  x  10“4 

1.38  (1.17  to  1.64) 

GG 

10.10 

16.55 

9.90  x  10-9 

2.09  (1.60  to  2.73) 

rs1051730 

GG 

46.55 

37.54 

AG 

43.72 

46.34 

1.40  x  10“3 

1.31  (1.11  to  1.56) 

AA 

9.73 

16.12 

2.84  x  IQ”8 

2.05  (1.57  to  2.70) 

*  GELCC  =  Genetic  Epidemiology  of  Lung  Cancer  Consortium;  OR  =  odds  ratio;  Cl  =  confidence  interval. 

t  The  GELCC  study  had  a  total  of  194  familial  case  patients  and  219  control  subjects.  The  Texas  study  had  a  total  of  1865  sporadic  case  patients  and  1769  control 
subjects.  The  UK  study  had  a  total  of  2013  sporadic  case  patients  and  3062  control  subjects.  The  Texas  and  UK  sporadic  lung  cancer  samples  are  from  the  study 
by  Amos  et  al.  (1 ). 

t  Fisher  exact  tests  were  performed.  All  statistical  tests  were  two-sided. 

§  rs1051730  was  genotyped  by  Human  Genetics  Division  Genotyping  Core. 


number  of  cigarettes  consumed  per  day, 
pack-years  of  exposure,  and  time  to  first 
cigarette  were  not  associated  with  SNPs  in 
the  region  (1,2).  In  contrast,  Thorgeirsson 
et  al.  (3)  reported  that  the  SNP  rsl051730 
was  statistically  significantly  associated  with 
lung  cancer  and  that  each  copy  of  the  T 
allele  was  associated  with  an  increase  in 
smoking  of  one  cigarette  per  day.  In  addi¬ 
tion,  Saccone  et  al.  (16)  identified  the  SNPs 
rs  16969968  (missense  in  CHRNA5)  and 
rs578776  (in  the  3 '-untranslated  region  of 
CHRNA3 )  as  functional  variants  that  were 
statistically  significantly  associated  with  nic¬ 
otine  dependence.  When  we  genotyped 
rs  16969968  and  rs578776  in  our  familial 
lung  cancer  population,  we  found  that  both 
were  statistically  significantly  associated 
with  lung  cancer  (for  rs  16969968,  P  =  2.29  x 
10“3;  for  rs578776,  P  =  4.47  x  IIP4).  Thus, 
polymorphisms  in  these  genes  may  affect 
nicotine  dependence  and  propensity  to 
smoke  and  to  develop  lung  cancer. 

Our  study  had  several  limitations.  First, 
the  association  of  risk  variants  with  nicotine 
dependence  (as  measured  by  the  Fagerstrom 
test  for  nicotine  dependence)  was  not  directly 


tested  because  these  data  were  unavailable. 
However,  smoking  quantity  (a  component  of 
the  Fagerstrom  test)  was  available.  We  did 
not  find  an  association  between  smoking 
quantity  and  the  statistically  significant  vari¬ 
ants  associated  with  lung  cancer.  Second,  the 
sample  size  in  our  familial  genome-wide 
association  analysis  was  small,  and  so  we 
might  not  have  been  able  to  detect  the 
smaller  effect  size  of  risk  alleles  located  in 
15q24-25.1  for  heterozygotes  in  familial  lung 
cancer. 

The  identity  of  variants  in  the  15q24- 
25.1  region  that  are  most  strongly  associated 
with  lung  cancer  remains  unknown  because 
many  SNPs  are  in  strong  linkage  disequilib¬ 
rium  with  each  other.  In  addition,  whether 
the  variants  associate  with  a  direct  or  indi¬ 
rect  mode  for  lung  cancer  remains  unre¬ 
solved.  From  results  of  this  and  three  other 
genome-wide  association  studies,  the  candi¬ 
date  genes  include  IREB2 ,  LOC123688, 
PSMA4 ,  CHRNAS,  CHRNA3 ,  and  CHRNB4 
(1-3).  CHRNA5 ,  CHRNA3 ,  and  CHRNB4 
are  strong  candidates  primarily  because 
they  encode  subunits  of  the  nicotinic  acetyl¬ 
choline  receptors.  These  genes  may  partici¬ 


pate  in  nicotinic  addiction  through  reward 
pathways  in  the  brain;  however,  evidence 
exists  that  nicotinic  acetylcholine  receptors 
are  associated  directly  with  lung  carcino¬ 
genesis  (17).  Nicotinic  acetylcholine  recep¬ 
tors  are  expressed  in  lung  cancers  of  both 
non-small  cell  and  small  cell  subtypes  (18), 
and  treatment  of  lung  cancer  cell  lines  with 
nicotine  can  inhibit  proapoptotic  pathways 
initiated  by  opioids  (19,20).  Therefore, 
determination  of  a  likely  single  candidate 
gene  and  further  delineation  of  whether 
variants  affect  lung  cancer  directly  or  indi¬ 
rectly  or  both  are  warranted. 
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DNA  amplification  is  a  ubiquitous  mechanism  of  oncogene  activation  in 
lung  and  other  cancers 
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Chromosomal  translocation  is  the  best-characterized 
genetic  mechanism  for  oncogene  activation.  However,  there 
are  documented  examples  of  activation  by  alternate 
mechanisms,  for  example  gene  dosage  increase,  though 
its  prevalence  is  unclear.  Here,  we  answered  the  funda¬ 
mental  question  of  the  contribution  of  DNA  amplification 
as  a  molecular  mechanism  driving  oncogenesis.  Comparing 
104  cancer  lines  representing  diverse  tissue  origins 
identified  genes  residing  in  amplification  ‘hotspots’  and 
discovered  an  unexpected  frequency  of  genes  activated  by 
this  mechanism.  The  3431  amplicons  identified  represent 
~10  per  hematological  and  ~36  per  epithelial  cancer 
genome.  Many  recurrently  amplified  oncogenes  were 
previously  known  to  be  activated  only  by  disease-specific 
translocations.  The  135  hotspots  identified  contain  538 
unique  genes  and  are  enriched  for  proliferation,  apoptosis 
and  linage-dependency  genes,  reflecting  functions  advanta¬ 
geous  to  tumor  growth.  Integrating  gene  dosage  with 
expression  data  validated  the  downstream  impact  of  the 
novel  amplification  events  in  both  cell  lines  and  clinical 
samples.  For  example,  multiple  downstream  components  of 
the  EGFR-family-signaling  pathway,  including  CDK5, 
AKT1  and  SHC1 ,  are  overexpressed  as  a  direct  result  of 
gene  amplification  in  lung  cancer.  Our  findings  suggest  that 
amplification  is  far  more  common  a  mechanism  of 
oncogene  activation  than  previously  believed  and  that 
specific  regions  of  the  genome  are  hotspots  of  amplification. 
Oncogene  advance  online  publication,  7  April  2008; 
doi:10.1038/onc.2008.98 

Keywords:  gene  amplification;  array  CGH;  gene  expres¬ 
sion;  integrative  analysis;  lung  cancer;  EGFR  signaling 


Introduction 

Genetic  aberration  and  the  consequential  activation  of 
oncogenes  are  key  to  cancer  development.  Chromosomal 
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translocation  is  known  as  the  major  event  in  oncogene 
activation  (Futreal  et  al .,  2004).  However,  the  preva¬ 
lence  of  alternate  mechanisms,  such  as  DNA  amplifica¬ 
tion,  have  not  been  extensively  quantified,  even  though 
oncogenes  have  been  found  in:  (i)  cytogenetically  visible 
double  minutes,  which  are  circular,  extrachromosomal 
elements,  a  few  megabases  in  size  that  replicate 
autonomously,  (ii)  homogenous  staining  regions,  which 
are  large  regions  of  tandem  repeats  within  a  chromo¬ 
some,  thought  to  be  formed  by  repeated  breakage- 
fusion-bridge  cycles  and  (iii)  discrete  insertions  distrib¬ 
uted  throughout  the  genome  (Albertson,  2006). 

Surprisingly,  relatively  few  oncogenes,  when  com¬ 
pared  to  chromosome  translocation,  have  been  shown  to 
undergo  amplification  as  mechanism  of  activation 
during  cancer  development.  In  fact,  a  recent  version 
(January  22,  2007)  of  a  census  of  genes  causally 
implicated  in  cancer  (cancer  genes)  originally  described 
by  Futreal  et  al.  (2004)  reported  only  seven  oncogenes 
meeting  their  criteria  as  being  recurrently  amplified  in 
the  development  of  human  cancers:  AKT2  in  ovarian 
cancer,  ERBB2  in  breast  and  ovarian  cancer,  MYCL1 
in  small  cell  lung  cancer,  MYCN  in  neuroblastoma, 
REL  in  Hodgkin  lymphoma,  epidermal  growth  factor 
receptor  {EGFR)  in  glioma  and  non-small  cell  lung 
cancer  (NSCLC),  and  MYC  in  numerous  cancers.  We 
propose  that  the  low  incidence  of  oncogenes  activated 
by  amplification  may  be  attributed  to  the  failure  of 
detection,  rather  than  governed  by  tumor  biology. 
Unlike  copy  number  gains,  which  are  generated  by 
aneueploidy  or  unbalanced  translocations  and  affect 
large  chromosomal  regions,  amplifications  are  tradi¬ 
tionally  defined  as  the  increase  of  chromosome  segments 
0.5-10  megabases  (Mb)  in  size  (Myllykangas  et  al ., 
2006).  The  small  size  of  amplicons  may  escape  detection 
by  conventional  cytogenetic  methods;  consequently,  the 
contribution  of  DNA  amplification  to  the  oncogenic 
process  may  be  grossly  underestimated.  With  advances 
in  high-resolution  whole-genome-profiling  technologies 
(Tonon  et  al.,  2005;  Garnis  et  al.,  2006),  the  complexity 
of  the  cancer  genome  is  becoming  evident,  and  the 
prevalence  of  DNA  amplification  as  a  mechanism  in  the 
activation  of  oncogenes  needs  to  be  re-evaluated. 

In  this  study,  we  determined  the  precise  boundaries  of 
amplified  chromosomal  segments  in  104  cancer  cell  lines 
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from  multiple  tissues  of  origin  and  deduced  novel 
regions  of  the  genome,  which  are  hotspots  for  genomic 
amplification.  These  hotspots  were  then  analysed  for 
their  association  with  genes  involved  in  tumorigenesis 
and  fragile  sites.  We  assessed  the  functional  impact  of  a 
subset  of  the  identified  hotspots  in  a  panel  of  NSCLC 
cell  lines  and  tumors  to  determine  their  effect  on  gene 
transcription  levels  and  their  contribution  to  the 
activation  of  cellular  pathways  potentially  involved  in 
lung  tumorigenesis. 


Results 

Identification  of  discrete  amplicons  in  cancer  genomes 
Twenty-four  thousand  eight  hundred  and  ninety-two 
genomic  loci  were  assessed  for  each  of  the  104  cell  lines, 
scanning  all  autosomes  at  a  resolution  of  ~  50  kb  (Coe 
et  al .,  2007).  Altogether,  3431  amplicons  were  detected 
across  all  samples  (see  Supplementary  Methods)  with  an 
average  size  of  0.68  Mb  and  a  median  of  0.33  Mb 
(Table  1,  Supplementary  Table  1).  The  number  of 
amplicons  per  genome  varied  from  0  to  199  with  an 
average  of  33.  Hematological  malignancies  (leukemia 
and  lymphomas)  had  ~  10  amplicons,  whereas  epithelial 
cancers  had  an  average  of  36. 


Unexpected  frequent  amplification  of  known  oncogenes 
The  most  recent  version  (January  22,  2007)  of  the 
Cancer  Gene  Consensus  of  the  Cancer  Genome  Project 
at  the  Sanger  Institute  (Futreal  et  al .,  2004)  contains  363 
cancer  genes  whose  aberration  are  causal  in  the 
development  of  specific  cancers.  Of  these,  70  are  tumor 
suppressor  genes,  292  are  oncogenes  and  one  can  act  as 
both.  Only  seven  (2%)  of  these  oncogenes  were  shown 
to  be  predominately  activated  by  amplification  com¬ 
pared  to  268  (92%),  which  are  activated  mainly  by 
chromosomal  translocation.  Our  data  showed  amplifi¬ 
cation  at  these  loci:  MYC  (28/104),  ERBB2  (10/104), 
EGFR  (7/104),  MYCL1  (6/104),  AKT2  (3/104)  and 
MYCN  (1/104).  REE  amplification  was  not  detected  in 


our  dataset,  as  Hodgkin  Lymphoma,  in  which  this  gene 
is  amplified,  is  not  represented  in  our  study. 

Unexpectedly,  145  of  the  292  oncogenes  (~50%) 
showed  amplification,  with  78  oncogenes  (27%)  at  ^2 
times  (Supplementary  Table  2).  Of  the  genes  amplified  in 
^5  cell  lines,  only  MFC,  ERBB2 ,  EGER  and  MYCL1 
have  been  reported.  The  frequent  amplification  of 
SS18L1 ,  NTRK1  and  PRDM16  are  novel  findings,  as 
translocation  was  the  known  mechanism.  Indeed, 
numerous  oncogenes,  which  are  primarily  activated  by 
translocation,  were  commonly  amplified  in  the  sample 
set  (Supplementary  Table  3).  The  number  of  oncogenes 
amplified  per  genome  also  varied  with  an  average  of  3.5 
genes.  Remarkably,  the  genomes  of  NSCLC  HCC1195 
(Supplementary  Figure  1)  and  SCLC  line  H526,  each 
harbor  22  amplified  oncogenes,  whereas  25  of  the  lines 
had  no  known  oncogenes  amplified  (Supplementary 
Table  4). 


Novel  hotspots  of  frequent  genomic  amplification 
in  cancer  genomes 

The  high  incidence  of  oncogene  amplification  per 
genome  suggested  that  this  is  a  common  mechanism  of 
gene  activation.  Therefore,  the  discovery  of  genomic 
regions  that  undergo  frequent  copy  number  amplifica¬ 
tion  may  lead  to  the  identification  of  novel  oncogenes. 
The  genomic  coordinates  of  all  amplicons  were  deter¬ 
mined  and  aligned  for  all  104  samples  (Figure  1).  DNA 
segments  amplified  ^5  times  were  stringently  consid¬ 
ered  as  hotspots;  they  are  found  in  —  5%  of  samples  (see 
Supplementary  Methods).  In  total,  135  hotspots  cover¬ 
ing  3%  of  genome  were  identified  with  an  average  size 
of  0.67  Mb.  Regions  of  genomic  amplification  were 
distributed  on  all  autosomes  except  chromosome  4,  and 
in  all  tumor  types  analysed  (Supplementary  Table  5). 
Amplicons  are  most  frequently  localized  to  lq21— 23, 
5pl5,  7p 1 3—1 1 ,  8q22-24,  llql3,  14ql2-21,  14q32, 
17q  12-21  and  20ql3.  A  total  of  538  unique  genes  were 
contained  within  the  hotspots  (Supplementary  Table  6) 
(see  Supplementary  Methods).  Interestingly,  the 
majority  of  these  hotspots  did  not  contain  the  292 
known  oncogenes. 


Table  1  Summary  and  distribution  of  amplicons  by  cancer  type 


Tissue  type 

Number 
of  Samples 

Total  number 
of  amplicons 

Amplicons/ 

Tumor 

Average  size 
(Mb) 

Most  frequent  amplification 1 

Lung 

53 

1690 

31.9 

0.68 

8q24.21  (28%) 

Breast 

17 

905 

53.2 

0.77 

8q24.21  (59%) 

Lymphoid 

9 

101 

11.2 

0.54 

9pl3.3,  13q31.3,  1 8q21 .33-22. 1  (33%) 

Cervix 

8 

45 

5.6 

0.65 

5pl 5.33  (38%) 

Skin 

4 

395 

98.8 

0.35 

7pl 3.3  &  7q35  (75%) 

Blood 

3 

23 

7.7 

0.80 

N/A 

Prostate 

3 

82 

27.3 

0.86 

14q21. 3-22.1  (67%) 

Bone 

2 

104 

52 

1.02 

N/A 

Colon 

2 

22 

11 

0.70 

N/A 

Ovary 

2 

3 

1.5 

0.33 

N/A 

Liver 

1 

61 

61 

0.83 

N/A 

Total 

104 

3431 

32.8 

0.68 

The  most  frequent  regions  were  determined  only  if  found  in  >  2  samples  for  the  corresponding  tissue  type. 
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Figure  1  Hotspots  of  amplification  in  cancer  genomes.  A  histogram  summarizing  the  regions  of  amplification  across  all  104  samples 
with  the  resulting  values  scaled  to  the  segment  with  the  highest  count  (28)  and  plotted  against  their  corresponding  genomic  position. 
Hotspots  are  denoted  by  the  dark  blue  shading,  whereas  the  light  blue  shading  represents  regions  amplified  ^5  times.  Triangles  mark 
common  fragile  sites.  Detailed  genomic  position  of  hotspots  and  common  fragile  sites  are  provided  in  Supplementary  Tables  5  and  17. 


There  was  no  association  between  amplification 
hotspots  and  known  fragile  sites  in  the  human  genome 
based  on  x2  test  at  the  chromosome  band  level,  even 
though  colocalization  does  exist,  for  example,  the  three 
fragile  sites  on  8q.  Figure  1  summaries  the  location  of 
the  86  common  fragile  sites  assayed  relative  to  hotspots 
of  amplification. 

Novel  amplification  hotspots  contain  putative  oncogenes 
Remarkably,  27  of  the  top  100  most  frequently 
amplified  genes  have  been  previously  described  to  be 
overexpressed  in  various  cancers  (Supplementary  Table 
7),  but  aside  from  MYC  and  ERRB2 ,  the  mechanism 
leading  to  overexpression  of  these  genes  was  largely 
unknown.  To  further  explore  the  properties  of  the 
amplified  genes,  functional  and  biological  characteristics 
were  evaluated  through  the  use  of  Ingenuity  Pathways 
Analysis  (Ingenuity  Systems,  see  (see  Supplementary 
Methods).  Functional  analysis  identified  a  significant 
association  between  the  amplified  genes  with  genes 
involved  in  cancer  (P  =  6.67E-06-3.03E-02;  the  two 
significance  values  refer  to  a  range  of  specific  sub- 
functions)  and  other  diseases  (Supplementary  Tables  8 
and  9).  Furthermore,  canonical  pathway  analysis  was 
used  to  determine  the  main  signaling  pathways  in 
which  the  amplified  genes  were  involved  (Supplementary 
Table  10).  Neuregulin  Signaling  (P  =  1.12E-02), 
also  known  as  EGFR-family-signaling,  was  the  most 
significantly  affected  with  GRB7 ,  SHC1 ,  SRC,  EGFR , 
ERBB2  and  AKT1  comprising  the  amplified  genes. 

Impact  of  amplification  on  gene  expression  levels 
To  understand  the  effects  of  amplification  on  gene 
regulation  and  transcription,  we  focused  on  one  type  of 
cancer.  Parallel  gene  expression  profiles  and  array 


comparative  genomic  hybridization  (CGH)  data  were 
integrated  for  27  NSCLC  cell  lines.  The  expression  levels 
for  genes  within  amplification  hotspots  (displayed  in 
Supplementary  Figure  2)  were  compared  between 
samples  with  amplification  and  those  with  neutral 
copy  number  status  using  the  Mann-Whitney  U  test 
(see  Supplementary  Methods).  In  total,  221  out  of  442  of 
the  amplified  genes  were  expressed  at  significantly  higher 
levels  (P<0.05)  with  increased  gene  dosage  (Figure  2 
and  Supplementary  Table  11).  For  the  majority  of  these 
genes,  amplification  is  a  novel  mechanism  for  activation, 
although  the  expression  levels  of  a  subset,  such  as  MYC, 
EGFR,  CDK4 ,  MAFB  and  MET,  are  known  to  be 
affected  by  increase  in  gene  dosage. 


Multiple  components  of  the  EGFR-family-signaling 
pathway  are  activated  by  DNA  amplification  in  NSCLC 
cell  lines  and  clinical  tumors 

To  relate  the  genes  activated  by  amplification  in  NSCLC 
to  biological  functions,  Functional  and  Canonical 
Pathway  Analysis  were  performed  using  IPA  software 
(Supplementary  Tables  12  and  13).  EGFR-family- 
signaling  was  the  most  affected  canonical  pathway 
(P  =  6.03E-03)  with  five  genes:  AKT1,  CDK5,  EGFR, 
MYC  and  SHC1  amplified  and  overexpressed  in  the  27 
NSCLCs  (Table  2,  Figure  2).  The  amplification  and 
subsequent  overexpression  of  AKT1,  CDK5  and  SHC1 
are  novel  findings  in  NSCLC.  Figure  3  displays  the 
interaction  of  these  genes  during  EGFR-family-signal¬ 
ing  and  the  resulting  downstream  effects  of  the 
activation  of  this  pathway,  which  includes  cell  prolifera¬ 
tion  and  survival.  Interestingly,  nearly  60%  of  the  cell 
lines  analysed  had  one  or  more  components  of  the 
EGFR  family  pathway  overexpressed  as  a  result  of 
amplification.  The  alteration  of  EGFR  and  MYC  alone 
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Table  2  Canonical  pathways  affected  by  amplification  in  non-small 
cell  lung  cancer 


Signaling  pathway  P -value 


Genes 


Neuregulin 


6.0E-03 


Huntington’s 

1.1E— 02 

disease 

VEGF 

3. IE— 02 

Insulin  receptor 

3.2E-02 

Nitric  oxide 

3.9E-02 

Hypoxia 

4.3E-02 

MYC,  CDK5,  SHC1,  EGFR 
and  AKT1 

SDH  A,  P0LR2H,  CDK5,  SHC1, 
P0LR2J,  EGFR  and  AKT1 
VEGF,  ARNT,  SHC1  and  AKT1 
PARD3,  PPP1R3D,  PPP1CA, 
SHC1  and  AKT1 
VEGF,  CALM1  and  AKT1 
VEGF,  ARNT  and  AKT1 


could  not  explain  pathway  disruption  in  all  cases  as 
~31%  (5/16)  of  samples  with  activated  downstream 
components  harbored  amplification  of  one  or  more  of 
CDK5 ,  SHC1  or  AKT1  independent  of  EGFR  and 
MYC.  Furthermore,  by  breaking  the  NSCLC  lines 
down  into  their  histological  subtypes,  it  was  discovered 
that  14  out  of  20  (70%)  adenocarcinoma  samples — 
whereas  no  squamous  and  only  one  large  cell  carcinoma 
samples — had  altered  components,  suggesting  that  the 
disruption  of  this  pathway  is  prevalent  in  the  adeno¬ 
carcinoma  subtype  of  lung  cancer. 

To  further  validate  our  results,  quantitative  real-time 
PCR  was  performed  on  select  genes,  AKT1 ,  CDK5  and 
SHC1.  First,  expression  levels  for  these  genes  were 
determined  using  the  A  A  Ct  method  and  compared 
between  cell  lines  and  normal  lung  tissue  to  confirm 
their  overexpression.  Relative  to  the  normal  lung 
reference,  AKT1  was  10.49-fold  overexpressed  in  sam¬ 
ples  with  gene  amplification  compared  to  1.94-fold 
overexpression  in  NSCLC  cells  with  neutral  copy 
number  status  for  this  gene  (Supplementary  Figure  3). 
Likewise,  CDK5  (Supplementary  Figure  4)  and  SHC1 
(Figure  4)  also  showed  higher  expression  with  increase 
gene  dosage,  suggesting  a  strong  correlation  of  gene 
dosage  and  transcription  levels  for  these  genes.  Expres¬ 
sion  changes  held  true  in  clinical  specimens  as  clinical 
adenocarcinoma  samples  frequently  showed  overexpres¬ 
sion  of  AKT1,  CDK5  and  SHC1  compared  to  their 
corresponding  matched  normal  lung  tissues  (Supple¬ 
mentary  Figure  3c,  4c  and  Figure  4c,  respectively).  Since 
these  genes  were  hypothesized  to  be  overexpressed  due 
to  DNA  amplification,  a  one-tailed  Wilcoxon  sign-rank 
test  was  used  to  determine  whether  overexpression  of 
these  genes  was  significant  in  the  set  of  matched 
tumor  and  normal  samples.  Indeed,  each  gene  was 
significantly  overexpressed  in  the  tumors  compared 
to  their  matched  normal  (P<0.05),  confirming  the 
results  from  the  cell  lines  (qPCR  data  is  provide  in 
Supplementary  Tables  14  and  15). 


Discussion 

Oncogene  activation  is  traditionally  associated 
with  translocation  events.  We  hypothesized  that 
DNA  amplification  is  a  prevalent,  but  underestimated, 
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Gene 

#  of  Samples  with  Amplification  and  Overexpression 

%  Samples  (n=27) 

EGFR 

5 

HI  81 9,  HCC2279,  H3255,  HCC4006,  HCC827 

18.5 

5HC1 

5 

HI  395,  HI 993,  H2122,  HCC1195,  HCC366 

18.5 

CDK5 

2 

H2009,  HCC193 

7.4 

AKT1 

3 

HCC366,  HI 81 9,  HCC461 

11.1 

c-  MYC 

8 

HI  395,  H2122,  HCC1195,  HCC2279,  HCC827,  H2087,  HI  975,  H460 

29.6 

*Total 

16 

59.3 

Figure  3  Frequent  amplification  and  overexpression  of  multiple  EGFR-family-signaling  components  in  non-small  cell  lung  cancer 
(NSCLC).  Diagram  highlighting  the  interaction  of  EGFR,  SHC1,  CDK5,  SHC1  and  MYC  in  the  EGFR  family-signaling  pathway. 
Altered  components  are  shaded  grey.  The  table  summarizes  the  number  and  specific  samples  with  amplification  and  overexpression  of 
each  pathway  component.  The  total  represents  the  number  of  samples  with  at  least  one  pathway  component  amplified  and 
overexpressed. 


mechanism  of  oncogene  activation  in  cancer  genomes. 
To  our  knowledge,  no  studies  to  date  have  assembled  a 
large  panel  of  paired  high-resolution  copy  number  and 
gene  expression  data  to  accurately  assess  this  question. 
In  this  study,  we  examined  104  cancer  cell  lines 
comprising  various  tissues  of  origin  (Table  1)  at  24  892 
autosomal  loci  per  genome,  a  resolution  that  detected 
amplicons  as  small  as  0.05  Mb  in  size  (Ishkanian  et  al ., 
2004)  and  discovered  that  not  only  is  the  incidence  of 
oncogene  amplification  much  greater  than  previously 
believed,  but  specific  regions  of  the  genome  are  hotspots 
for  segmental  amplification  in  cancer  cells. 

Amplification  as  a  major  mechanism  of  oncogene 
activation 

The  activation  of  oncogenes  is  a  hallmark  of  tumor 
development.  Cancer  cells  frequently  display  chromo¬ 
some  rearrangements  resulting  in  the  deregulation  of 


gene  expression,  as  well  as  in  the  fusion  of  genes  raising 
oncogenic  activity.  As  such,  the  majority  of  known 
oncogenes,  including  92%  of  those  analysed  in  this 
study,  have  been  discovered  through  their  involvement 
in  disease-specific  chromosomal  translocations  (Futreal 
et  al .,  2004).  Thus,  the  high  incidence  of  amplification 
we  report  suggests  that  oncogenes  may  have  multiple 
mechanisms  of  activation,  with  the  increase  in  gene 
dosage  being  a  prominent  mechanism  of  activation.  This 
was  particularly  evident  in  the  fact  that  genes,  which 
have  been  shown  to  be  activated  primarily  by  transloca¬ 
tion,  were  frequently  amplified  (Supplementary 
Table  3).  The  majority  of  these  genes  has  not  been 
shown  to  be  activated  by  amplification  previously,  and 
as  such,  this  data  represent  a  novel  finding.  For 
example,  t(14;20)(q32;ql2)  translocation  is  known  to 
juxtapose  IgH  enhancers  to  the  MAFB  gene  locus 
upregulating  its  expression  in  multiple  myeloma  (Wang 
et  al .,  1999;  Boersma-Vreugdenhil  et  al .,  2004).  We 
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Figure  4  SHC1  disruption  in  NSCLC  cell  lines  and  clinical  tumors,  (a)  Representative  array  CGH  profiles  for  samples  with  and 
without  SHC1  amplification.  Vertical  lines  denote  log2  signal  ratios  from  —1  to  1  with  copy  number  increases  to  the  right  (red  lines) 
and  decreases  to  the  left  (green  lines)  of  0  (purple  line),  with  amplified  region  shaded  orange.  Red  and  green  arrows  mark  clones  used  in 
subsequent  Fluorescence  in  situ  hybridization  (FISH)  analysis,  (b)  SHC1  expression  in  NSCLC  cell  lines.  The  normalized  fold  change 
of  expression  compared  to  a  normal  lung  reference  is  plotted  for  samples  with  amplification  (red)  and  those  with  neutral  copy  number 
status  (black),  (c)  Overexpression  of  SHC1  in  clinical  tumors.  The  log2  fold  change  in  expression  levels  of  SHC1  relative  to  their 
matched  normal  lung  tissue  is  plotted  for  each  tumor.  The  P-value  from  the  Wilcoxon  sign-rank  test  is  indicated,  (d  and  e)  FISH 
confirmation  of  SHC1  amplification  in  H1395.  FISH  was  performed  using  BAC  clones  mapping  to  SHC1  (RP11-624P9)  and  to  an 
adjacent  neutral  copy  number  region  (RP11-313J15). 


demonstrated  that  MAFB  amplification  also  occurs  in 
lung  (H1395,  H1650  and  H1666),  cervical  (SW756) 
and  liver  (HepG2)  cancer  cells.  Likewise,  NTRK1  is 
an  oncogene  frequently  activated  by  translocation 
(Roccato  et  al 2005).  Fusion  with  TPR,  TPM3  or 
TFG  results  in  constitutive  tyrosine  kinase  activity 
(Pierotti  et  al 1996).  Remarkably,  we  also  detected 
NTRK1  copy  number  increased  in  lung  (HCC366, 
HCC1833,  HCC1195,  H82,  H526,  H2122  and  H187) 
and  breast  (ZR7530)  cancer  genomes,  suggesting  that 
amplification  and  subsequent  overexpression  may  be  an 
alternate  mechanism  of  activation. 


Existence  of  an  amplifier  phenotype 
There  is  evidence  suggesting  that  some  cancer  cells  have 
a  greater  propensity  to  undergo  DNA  amplification 
than  others  and  that  there  is  an  underlying  genetic  basis 
for  this  ‘amplifier  phenotype’  (Albertson,  2006).  Our 
data  showed  that  the  number  of  oncogenes  amplified 
may  differ  in  individual  genomes.  The  variation  existed 
across  both  general  cancer  classes  and  individual  tissue 
types.  In  addition,  amplification  of  the  same  genes  in  the 
different  tumor  types  suggests  that  there  may  be  a 
selective  advantage  to  have  certain  genes  or  their  related 
functions  elevated  in  the  context  of  cancer  development 
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and  that  amplifications  are  not  simply  byproducts  of 
general  genomic  instability  characteristic  of  late-stage 
tumors.  It  was  also  common  for  a  sample  to  simulta¬ 
neously  harbor  multiple  amplicons  on  different  chromo¬ 
somes,  highlighting  the  possibility  of  an  underlying 
genetic  basis  for  amplification  development  (Supple¬ 
mentary  Figure  1).  It  has  been  proposed  that  amplifica¬ 
tions  are  mainly  related  to  solid  tumors  and  are  seldom 
involved  in  hematological  malignancies  in  which 
oncogene  activation  is  generally  associated  with  trans¬ 
locations  (Mitelman,  2000).  Indeed,  only  11  known 
oncogenes  were  amplified  in  leukemia  or  lymphoma 
genomes  in  our  data  set,  whereas  144  were  amplified  in 
epithelial  cases  (Supplementary  Table  2).  Similar  results 
were  observed  in  the  number  of  amplicons  in  each 
genome  (~10  per  hematological,  ~36  in  epithelial 
samples).  These  results  suggest  that  subsets  of  cancers, 
such  as  the  epithelial  cancers,  are  driven  by  an  amplifier 
phenotype,  whereas  others,  typically  hematological 
malignancies,  develop  mainly  through  different  genetic 
mechanisms,  such  as  chromosomal  translocation. 


Novel  amplification  hotspots  are  enriched  for  putative 
oncogenes 

Figure  1  indicates  that  regions  of  frequent  copy  number 
amplification  are  preferentially  localized  in  genome. 
These  results  are  similar  to  those  found  in  a  bibliomics 
survey,  which  looked  across  73  different  neoplasms 
using  conventional  CGH  (Myllykangas  et  al.,  2006). 
However,  in  this  study,  we  further  refined  the  regions 
beyond  the  chromosome  band  level,  determining  the 
exact  genes  affected  by  these  aberrations  (Supplemen¬ 
tary  Table  6),  identifying  novel  hotspots,  such  as  the 
discrete  amplicons  on  14q. 

There  are  two  potential  factors  that  may  determine 
the  localization  of  amplification  hotspots.  First,  the 
selective  pressure  imposed  on  the  tumor  may  lead  to  the 
selection  of  amplification  of  regions  containing  genes 
advantageous  to  tumor  growth.  Consistent  with  this 
theory,  we  observed  a  significant  enrichment  for  genes 
involved  in  cellular  functions  and  canonical  pathways 
commonly  involved  in  tumorigenesis  within  the  ampli¬ 
fication  hotspots  (Supplementary  Tables  8-10).  Many 
genes  implicated  in  key  biological  processes,  such  as  cell 
cycle  and  cell  growth  and  proliferation  contained  within 
these  regions  that  may  be  considered  as  novel  candidate 
oncogenes.  In  addition,  27  of  the  top  100  genes  within 
the  hotspots  have  been  previously  described  to  be 
overexpressed  in  cancer  further  supporting  their  onco¬ 
genic  role  (Supplementary  Table  7).  Since  the  mechan¬ 
ism  leading  to  the  overexpression  of  the  majority  of 
these  genes  was  previously  unknown,  our  data  suggest 
that  amplification  may  be  a  key  mechanism  of  their 
activation. 

Second,  the  intrinsic  features  of  the  chromosome 
regions  themselves  may  be  involved  in  their  preferential 
amplification  (Wahl  et  al .,  1984).  Mechanistic  models, 
such  as  breakage-fusion-bridge  and  episome  excision, 
imply  that  two  double-stranded  DNA  breaks  are 
required  to  initiate  amplification  generation  (Myllykangas 
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and  Knuutila,  2006).  As  such,  it  has  been  proposed  that 
regions  of  the  genome  that  are  more  susceptible  to 
breakage  have  a  greater  propensity  to  undergo  ampli¬ 
fication,  such  as,  fragile  sites  (Heilman  et  al .,  2002; 
Buttel  et  al .,  2004).  Conventional  CGH  studies  indicated 
that  many  amplification  hotspots  colocalized  with 
fragile  sites,  however,  this  association  was  not  statisti¬ 
cally  significant  on  a  genome-wide  scale,  presumably 
due  to  inadequate  resolution  (with  chromosome  band 
level  comparison)  as  fragile  sites  and  amplification 
hotspots  covered  30  and  45%  of  the  genome,  respec¬ 
tively  (Myllykangas  et  al .,  2006).  Our  analysis  addressed 
this  problem  as  the  increased  resolution  of  the  platform 
used  in  this  study  allowed  the  refinement  of  amplifica¬ 
tion  hotspots,  limiting  their  coverage  to  ~3%  of  the 
genome.  However,  although  we  observed  a  general  trend 
of  the  colocalization  of  amplification  hotspots  and 
common  fragile  sites  (Figure  1),  the  global  association 
was  still  not  significant.  We  speculate  that  the  cloning  of 
fragile  sites  to  determine  their  specific  sequences  is 
needed  to  complement  our  array  CGH  data  to 
accurately  assess  their  association.  Nevertheless,  there 
is  a  strong  possibility  that  the  hotspot  regions  represent 
damage-prone  sites  in  the  genome  and  further  investiga¬ 
tion  in  the  future  is  warranted.  Notably,  in  addition  to 
fragile  sites,  other  genomic  features,  such  as  copy 
number  variations  and  segmental  duplications,  may 
also  contribute  to  DNA  rearrangement  in  cancer  cells 
(Squire  et  al .,  2003). 
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Global  impact  of  amplification  on  gene  expression  levels 
in  NSCLC 

Although  array  CGH  allows  the  fine  mapping  of 
amplification  boundaries  at  unprecedented  resolution, 
multiple  genes  may  map  to  an  individual  amplicon. 
Therefore,  integration  of  copy  number  and  expression 
data  is  needed  to  distinguish  overexpressed  genes  from 
bystander  genes  within  amplicons.  To  accomplish  this, 
we  integrated  parallel  array  CGH  and  expression  data 
for  a  subset  of  27  NSCLC  cell  lines.  Aside  from  genes 
known  to  be  activated  by  amplification  in  NSCLC, 
such  as  EGFR  and  MYC ,  our  data  suggest  that  the 
expression  of  oncogenes,  including  CDK4 ,  MAFB, 
BCF11B  and  MET ,  is  also  driven  by  amplification 
(Figure  2).  Since  these  genes  are  typically  activated  by 
translocation  or  missense  mutation  in  malignancies 
other  than  NSCLC,  these  data  further  supports  our 
hypothesis  that  amplification  is  an  alternate  mechanism 
of  oncogene  activation  in  a  subset  of  cancers. 

The  integration  of  the  data  sets  identified  expressed 
genes  within  novel  amplification  hotspots  in  lung  cancer 
that  are  potentially  involved  in  tumorigenesis.  For 
example,  thyroid  transcription  factor  1  ( TITF1 )  in  the 
novel  hotspot  at  14ql2-ql3  is  known  to  be  over¬ 
expressed  specifically  in  lung  adenocarcinoma,  the 
predominate  subtype  of  NSCLC  from  which  the  27  cell 
lines  in  our  study  were  derived  (Fabbro  et  al .,  1996). 
This  gene  encodes  a  homeodomain  transcription  factor 
that  is  involved  in  regulating  pulmonary  development 
and  gene  expression  (Apergis  et  al .,  1998)  and  has  been 
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proposed  to  be  a  lineage  marker  for  tumors  arising  from 
the  peripheral  airway  (Stenhouse  et  al .,  2004).  Adeno¬ 
carcinomas  that  express  TITF1  are  dependent  on  its 
persistent  expression  for  survival  (Tanaka  et  al .,  2007). 
Acquired  somatic  alteration  of  this  gene  during  differ¬ 
entiation  leads  to  aberrant  lineage-survival  pathway 
signaling.  The  resulting  tumors  become  addicted  to 
persistent  expression  of  the  gene  for  survival,  known  as 
‘lineage  addiction’  (Garraway  and  Sellers,  2006).  Our 
data  suggests  that  amplification  may  be  a  mechanism 
driving  the  expression  of  linage- survival  oncogenes  in  a 
subset  of  cancers  and  further  supports  a  role  for  TITF1 
in  lung  adenocarcinoma  tumorigenesis  (Figure  2  and 
Supplementary  Figure  5). 

On  a  global  scale,  we  found  amplification  had  a 
strong  impact  on  transcription  levels  as  50%  of  the 
genes  from  amplification  hotspots  within  these  samples 
showed  enhanced  expression  as  a  consequence  of 
alteration.  This  falls  within  observations  of  previous 
studies  that  reported  19.3-62%  of  amplified  genes  being 
overexpressed  (Hyman  et  al .,  2002;  Pollack  et  al .,  2002; 
Wolf  et  al .,  2004;  Heidenblad  et  al .,  2005).  Nevertheless, 
since  this  is  the  first  study  to  integrate  high-resolution 
copy  number  and  gene  expression  profiles  in  lung  cancer 
on  a  whole-genome  scale,  future  analysis  will  be  needed 
to  confirm  the  functional  impact  of  the  genes  in  each 
amplicon. 

Novel  disruptions  of  the  EG FR-family- signaling  pathway 
in  NSCLC  by  gene  amplification 

Our  data  indicate  that  multiple  components  of  the 
EGFR-family-signaling  pathway  are  frequently  ampli¬ 
fied  and  overexpressed  in  NSCLC  (Figure  3).  Deregula¬ 
tion  of  this  network  commonly  occurs  in  cancer  and 
specifically  NSCLC.  In  NSCLC,  the  mechanism  of 
deregulation  is  usually  attributed  to  receptor  over¬ 
expression  or  point  mutations  in  the  catalytic  domain, 
resulting  in  ligand-independent  constitutive  receptor 
activation  and  signaling  (Bublil  and  Yarden,  2007). 
However,  in  cases  with  normal  levels  of  wild-type 
receptor,  aberrant  constitutive  signaling  may  also  occur. 
Strikingly,  our  data  suggest  that  amplification  of 
downstream  signaling  components  may  be  an  alternate 
mechanism  of  pathway  activation  in  a  subset  of  tumors. 
Although  alteration  of  this  pathway  was  detected  in 
59%  of  the  cell  lines  analysed,  only  31%  of  these  cases 
could  be  explained  by  EGFR  activation.  Indeed,  the 
majority  of  cell  lines  with  pathway  perturbation  (69%) 
contained  amplification  of  key  signaling  components 
downstream  of  the  receptor  level  (Figure  3).  The 
overexpression  of  these  components  was  confirmed  in 
NSCLC  tumors,  highlighting  their  clinical  significance 
(Figure  4  and  Supplementary  Figures  3  and  4). 
Although  MYC  amplification  has  been  previously 
reported,  this  is  the  first  study  to  describe  the  frequent 
amplification  and  overexpression  of  SFfCl,  AKT1  and 
CDK5  in  NSCLC.  Since  these  genes  are  involved  in  the 
activation  of  mitogenic- signaling  pathways  involved  in 
EGFR-induced  transformation  and  their  overexpression 
has  been  previously  implicated  in  cancer,  these  results 
suggest  that  their  direct  genetic  activation  may  play  a 


causal  role  in  NSCLC  tumorigenesis  and  highlights  the 
impact  of  these  novel  amplifications  on  cancer  biology 
(Hennessy  et  al .,  2005). 

The  direct  amplification  of  downstream  components 
may  also  have  substantial  effect  on  the  response  to 
clinical  treatment  strategies.  The  high  frequency  of  EGFR 
family  overexpression  in  cancer  has  led  to  the  develop¬ 
ment  of  targeted  therapeutics  aimed  at  inhibiting  receptor 
function.  For  example,  anti-ErbB2  antibodies  are  cur¬ 
rently  used  for  breast  cancer  treatment  and  EGFR- 
specific  tyrosine  kinase  inhibitors,  such  as  Gefitinib  and 
Erlotinib  are  used  in  NSCLC  therapy  (Bublil  and 
Yarden,  2007).  The  receptor-independent  activation  of 
downstream  signaling  components  would  impact  the 
effectiveness  of  these  treatment  strategies  as  constitutive 
activation  of  signaling  pathways  would  occur  regardless 
of  receptor  inhibition.  Previous  studies  have  shown  that 
downregulation  of  the  AKT/PI3K  signaling  pathway  is 
required  for  EGFR  tyrosine  kinase  inhibitors  to  induce 
apoptosis  in  cancer  cells  (Hemstrom  et  al.,  2006).  In 
addition,  activated  AKT/PI3K  signaling  due  to  MET 
amplification  has  been  shown  to  lead  to  Gefitinib 
resistance  in  NSCLC  cells  (Engelman  et  al.,  2007)  Thus, 
the  direct  activation  of  AKT1  and  CDK5  would  lead  to 
resistance  in  a  NSCLC  due  to  maintained  AKT/PI3K 
signaling  in  the  presence  of  inhibitor.  Likewise,  in  a  drug- 
resistant  NSCLC  cell  line,  alterations  of  adaptor-protein- 
mediated  signal  transduction  from  EGFR,  such  as  those 
initiated  by  SHC1,  has  been  proposed  as  a  possible 
mechanism  of  resistance  to  Gefitinib  (Koizumi  et  al., 
2005).  Our  findings  highlight  the  need  to  assess  the 
activation  status  of  downstream  signaling  components 
and  suggest  that  amplification  AKT1,  SHC1,  CDK5  and 
MYC  may  be  used  for  this  process. 

Conclusion 

Since  alterations  at  the  DNA  level  potentially  represent 
causal  events  in  the  development  of  cancer,  the  genes 
deregulated  as  a  result  of  amplification  in  NSCLC  can 
be  viewed  as  the  primary  oncogenic  targets  in  a  tumor 
that  lead  to  downstream  pathway  abrogation.  The  gain- 
of-function  effect  of  gene  amplifications  makes  them 
ideal  targets  for  therapeutic  intervention  due  to  the 
direct  nature  of  their  activation  and  the  fact  that  a 
tumor  can  become  addicted  to  their  enhanced  expres¬ 
sion  (Weinstein,  2002).  Although  cancer  cell  lines  were 
used  in  this  study,  the  genomic  and  transcriptional 
characteristics  of  such  models  have  been  shown  to 
mirror  primary  tumors  and  are  appropriate  systems  to 
identify  molecular  features  that  predict  or  indicate 
response  to  targeted  therapies  (Neve  et  al.,  2006; 
Greshock  et  al.,  2007).  Furthermore,  validation  of  a 
subset  of  amplified  genes  in  primary  tumors  confirmed 
their  clinical  importance.  Future  studies  of  additional 
primary  tumors  will  be  required  to  further  validate  the 
role  of  the  hotspots  in  clinical  specimens  and  confirm 
that  they  are  not  artifacts  of  in  vitro  culture.  Our 
discovery  of  high  incidence  of  amplification  suggests 
that  it  is  a  major  mechanism  of  oncogene  activation  in 
cancer  and  will  provide  essential  starting  points  for  the 
discovery  of  novel  oncogenes. 
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Materials  and  methods 

Whole-genome  profiling 

DNA  copy  number  profiles  for  104  cancer  cell  lines  of  lung, 
breast,  prostate,  cervical,  skin,  ovarian,  liver  and  hematologi¬ 
cal  origins  were  used  in  this  study  (Supplementary  Table  16). 
DNA  was  isolated  by  proteinase  K  digestion  followed  by 
phenol-chloroform  extraction.  Array  hybridization  was  per¬ 
formed  as  previously  described  (Lockwood  et  al .,  2007),  using 
SMRT  array  v.2  (Ishkanian  et  al .,  2004;  Watson  et  al .,  2007). 
Array  images  were  analysed  using  SoftWoRx  Tracker  Spot 
Analysis  software  (Applied  Precision,  Issaquah,  WA,  USA). 
Systematic  biases  were  removed  using  the  stepwise  normal¬ 
ization  procedure  CGH  Norm  (Khojasteh  et  al .,  2005).  SeeGH 
software  allowed  visualization  of  log2  ratio  plots  in  karyo- 
grams  (Chi  et  al .,  2004).  All  raw  array  data  files  have  been 
made  publicly  available  through  the  System  for  Integrative 
Genomic  Microarray  Analysis  (SIGMA),  which  can  be 
accessed  at  http://sigma.bccrc.ca  (Chari  et  al .,  2006). 

Gene  expression  profiling 

RNA  samples  from  27  NSCLC  cell  lines  and  normal  human 
bronchial  epithelial  cells  were  analysed  using  the  Affymetrix 
Gene  Chips  HG-U133A  and  HG-U133B  (Henderson  et  al , 
2005;  Zhou  et  al ,  2006)  (Supplementary  Table  16).  These 
arrays  together  represent  23  583  unique  genes  based  on 
Unigene  build  173.  The  identity  of  these  cell  lines  has  been 
verified  by  DNA  fingerprint,  using  the  Powerplex  1.2  system 
(Promega).  Data  normalization  and  microarray  analysis  was 
performed  using  Affymetrix  Microarray  Suite  5.0  as  described 
previously(Zhou  et  al ,  2006).  The  microarray  data  have  been 
uploaded  to  GEO  (Gene  Expression  Omnibus,  accession 
number  GSE-4824). 


Statistical  analysis  of  array  data 

Detailed  methods  describing  the  statistical  analysis  used  for 
the  identification  of  amplicons,  amplification  hotspots,  ampli¬ 
fication  hotspots  and  fragile  site  colocalization,  functional 
assessment  of  amplified  genes  and  integration  of  genomic,  and 
gene  expression  data  are  provided  in  Supplementary  Methods. 
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Gene-specific  quantitative  real-time  reverse  transcriptase  PCR 
analysis 

TaqMan  gene  expression  assays  (AKT1  (Hs00178289_ml), 
SHC1  (Hs00427539_ml),  CDK5  (Hs00358991_gl)  and  18  s 
rRNA  (Hs99999901_sl)  were  performed  using  100  ng  of  cDNA 
samples  in  a  7500  Fast  Real-Time  PCR  System  (Applied 
Bio  systems,  Foster  City,  CA,  USA).  The  A  A  Ct  method  was 
used  for  expression  quantification  using  the  average  cycle 
threshold  of  18S  rRNA  for  normalization  (Coe  et  al ,  2006)  and 
human  lung  total  RNA  (AM7968,  Ambion,  Austin,  TX,  USA) 
as  a  reference.  For  clinical  samples,  total  RNA  was  isolated 
from  10  microdissected  frozen  lung  adenocarcinoma  and 
matched  normal  tissue  obtained  from  Vancouver  General 
Hospital  using  RNeasy  Mini  Kits  (QIAGEN  Inc.,  Mississauga, 
ON,  USA)  and  1  pg  was  converted  to  cDNA  for  gene-specific 
quantitative  PCR  for  AKTf  SHC1 ,  CDK5  and  18S  rRNA. 
Cycle  thresholds  comparison  yielded  expression  changes  in  the 
tumors.  Because  these  genes  were  hypothesized  to  be  over¬ 
expressed,  owing  to  DNA  amplification,  a  one-tailed  Wilcoxon 
sign-rank  test  was  used  to  determine  whether  overexpression 
was  significant  in  the  set  of  matched  tumor  and  normal  samples. 

Fluorescence  in  situ  hybridization 

Fluorescence  in  situ  hybridization  was  performed  as  described 
previously  (Watson  et  al ,  2004).  Briefly,  100  ng  of  linker- 
mediated  PCR-amplified  BAC  DNA  was  labeled  through  a 
random  priming  reaction  with  Spectrum  Green  or  Red  dUTP 
(Vysis,  Markham,  ON,  USA).  Hybridization  was  performed  in 
a  50%  formamide  buffer  at  37  °C  for  18  h  and  imaged  with  Q 
Capture  imaging  software  (Q  Imaging,  Burnaby,  BC,  USA). 
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Loss  and  Reduction  of  Fusl  Protein  Expression  is  a  Frequent 
Phenomenon  in  the  Pathogenesis  of  Lung  Cancer 
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Ludmila  Prudkin,  Carmen  Behrens,  Diane  D.  Liu,  Xian  Zhou,  Natalie  C.  Ozburn,  B.  Nebiyou  Bekele, 
John  D.  Minna,5,6,7  Cesar  Moran,1,2  Jack  A.  Roth,4  Lin  Ji,4and  Ignacio  I.  Wistuba1,2 


Abstract  Purpose:  FUS1,  a  novel  tumor-suppressor  gene  located  in  the  chromosome  3p21.3  region,  may 
play  an  important  role  in  lung  cancer  development.  Currently,  FUS1- expressing  nanoparticles 
have  been  developed  for  treating  patients  with  lung  cancer.  However,  the  expression  of  Fusl 
protein  has  not  been  examined  in  a  large  series  of  lung  cancers  and  their  sequential  preneoplastic 
lesions. 

Experimental  Design:  Using  tissue  microarrays,  we  examined  Fusl  immunohistochemical 
expression  in  281  non -small  cell  lung  carcinoma  (NSCLC)  and  22  small  cell  lung  carcinoma 
tissue  specimens  and  correlated  the  findings  with  patients'  clinicopathologic  features.  To  inves¬ 
tigate  the  expression  of  Fusl  in  the  early  sequential  pathogenesis  of  NSCLC,  we  studied  Fusl 
expression  in  211  histologically  normal  and  mildly  abnormal  bronchial  epithelia,  and  118  bronchial 
and  alveolar  preneoplastic  lesions  obtained  from  patients  with  lung  cancer. 

Results:  Loss  and  reduction  of  expression  was  detected  in  82%  of  NSCLCs  and  100%  of 
small  cell  lung  carcinomas.  In  NSCLCs,  loss  of  Fusl  immunohistochemical  expression  was 
associated  with  significantly  worse  overall  survival.  Bronchial  squamous  metaplastic  and 
dysplastic  lesions  expressed  significantly  lower  levels  of  Fusl  compared  with  normal  (P  =  0.014 
and  0.047,  respectively)  and  hyperplastic  (P  =  0.013  and  0.028,  respectively)  epithelia. 
Conclusions:  Our  findings  show  a  high  frequency  of  Fusl  protein  loss  and  reduction  of  expres¬ 
sion  in  lung  cancer,  and  suggests  that  this  reduction  may  play  an  important  role  in  the  early 
pathogenesis  of  lung  squamous  cell  carcinoma.  These  findings  support  the  concept  that  FUS1 
gene  and  Fusl  protein  abnormalities  could  be  used  to  develop  new  strategies  for  molecular  cancer 
therapy  for  a  significant  subset  of  lung  tumors. 


Lung  cancer  is  the  leading  cause  of  cancer-related  deaths  in  the 
United  States  (1).  Lung  cancer  consists  of  several  histologic 
types  (2),  the  most  frequent  being  small  cell  lung  carcinoma 
(SCLC)  and  two  types  of  non -small  cell  lung  carcinoma 
(NSCLC),  adenocarcinoma,  and  squamous  cell  carcinoma  (3). 
In  spite  of  advances,  the  underlying  processes  involved  in  the 
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early  pathogenesis  of  lung  cancer  remain  unclear.  NSCLCs  are 
believed  to  arise  after  the  progression  of  sequential  preneo¬ 
plastic  lesions,  including  bronchial  squamous  dysplasias  for 
squamous  cell  carcinoma  and  atypical  adenomatous  hyper¬ 
plasias  (AAH)  for  a  subset  of  adenocarcinomas  (4).  An  in¬ 
creased  understanding  of  the  molecular  mechanisms  involved 
in  the  pathogenesis  and  progression  of  lung  cancer  may  lead  to 
new  and  more  effective  strategies  for  early  detection  and 
targeted  chemoprevention  and  treatment. 

Tumor-suppressor  genes  play  a  major  role  in  the  pathogen¬ 
esis  of  human  lung  and  other  cancers  (2).  Lung  cancer  cells 
harbor  mutations  and  deletions  in  multiple  known  tumor- 
suppressor  genes;  however,  genetic  alterations  and  allelic  losses 
(loss  of  heterozygosity)  on  the  short  arm  of  chromosome  3  sites 
(3p25-26,  3p21.3-22,  3pl4,  and  3pl2)  are  among  the  most 
frequent  and  earliest  molecular  abnormalities  detected  in  the 
pathogenesis  of  lung  cancer  (5,  6).  In  particular,  chromosomal 
abnormalities  at  the  3p21.3  region  and  expressional  deficien¬ 
cies  in  3p21.3  genes  are  frequently  found  in  lung  cancer  (7).  In 
addition,  3p21.3  allelic  losses  have  been  frequently  detected  in 
histologically  normal  bronchial  epithelia  and  preneoplastic 
lesions  in  lung  cancer  patients  and  smokers  (6,  8). 

The  novel  FUS1  gene  is  one  of  the  candidate  tumor- suppressor 
genes  that  have  been  identified  in  a  120-kb  homozygous 
deletion  region  in  human  chromosome  3p21.3  (5,  9-11). 
Genomic  alterations  of  the  FUS1  gene  and  resultant  loss 
of  expression  or  deficiency  of  posttranslational  modification  of 
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the  Fusl  protein  have  been  found  in  a  majority  of  NSCLC  cell 
lines  and  in  almost  all  SCLCs  (9-11).  Recently,  it  was  reported 
that  Fusl  is  a  myristoylated  protein  and  that  myristoylation  in  its 
NH2  terminus  is  required  for  FUS1  -mediated  tumor  suppression 
activity  (9).  Immunohistochemical  Fusl  expression  examination 
of  20  NSCLC  tissue  specimens  showed  loss  of  protein  in  15  of  20 
(75%)  cases,  and  these  findings  were  confirmed  by  mass 
spectrometric  analysis  (11).  To  translate  these  findings  into 
clinical  applications  for  molecular  cancer  therapy,  a  novel  FUS1  - 
expressing  nanoparticle  has  been  developed  for  treating  patients 
with  lung  cancer  (12),  suggesting  that  FUS1  gene  and  protein 
abnormalities  could  be  used  to  develop  new  strategies  for 
molecular  cancer  therapy.  To  date,  however,  the  expression  of 
Fusl  has  not  been  studied  comprehensively  in  lung  cancer 
tumors  and  lung  preneoplastic  lesion  tissues. 

To  better  understand  the  importance  of  Fusl  expression  in 
lung  cancer  pathogenesis  and  progression,  we  investigated 
Fusl  immunohistochemical  expression  in  a  large  series  of 
NSCLC  and  SCLC  tumor  tissue  specimens  and  adjacent  lung 
bronchial  and  alveolar  epithelial  foci  using  tissue  microarray 
specimens,  and  we  correlated  those  findings  with  the  clinico- 
pathologic  features  of  patients  with  lung  cancer. 

Materials  and  Methods 

Case  selection  and  tissue  microarray  construction.  We  obtained 
archival,  formalin-fixed  and  paraffin-embedded  material  from  surgical¬ 
ly  resected  lung  cancer  specimens  containing  tumor  and  adjacent  lung 
tissues  from  the  Lung  Cancer  Specialized  Program  of  Research 
Excellence  Tissue  Bank  at  The  University  of  Texas  M.D.  Anderson 
Cancer  Center  (Houston,  TX)  from  1997  to  2001.  This  study  was 
approved  by  the  Institutional  Review  Board.  Tumor  tissue  specimens 
from  303  lung  cancers  (22  SCLCs,  172  adenocarcinomas,  and  109 
squamous  cell  carcinomas)  were  histologically  examined,  classified 
using  the  2004  WHO  classification  system  (3),  and  selected  for  tissue 
microarray  construction.  After  histologic  examination,  the  tissue 
microarrays  were  constmcted  using  triplicate  1  mm  diameter  cores 
from  each  tumor. 

Detailed  clinical  and  pathologic  information,  including  demographic 
data,  smoking  history  (never-  and  ever-smokers)  and  status  (never, 


former,  and  current  smokers),  pathologic  tumor-node-metastasis 
staging  (13),  overall  survival,  and  time  of  recurrence,  was  available  in 
most  cases  (Table  1).  Patients  who  had  smoked  at  least  100  cigarettes  in 
their  lifetime  were  defined  as  smokers,  and  smokers  who  quit  smoking 
at  least  12  months  before  lung  cancer  diagnosis  were  defined  as  former 
smokers. 

To  assess  Fusl  immunohistochemical  expression  in  the  pathogenesis 
of  NSCLC  from  the  surgically  resected  formalin-fixed  and  paraffin- 
embedded  specimens,  we  included  329  bronchial  epithelium  speci¬ 
mens  which  had  normal  histology  (n  =  68),  basal  cell  hyperplasia 
(n  =  120),  squamous  metaplasia  ( n  =  23),  squamous  dysplasia  (n  =  62), 
and  AAH  lesions  (n  =  56)  for  analysis  (Table  1).  Histologic  classifica¬ 
tion  of  epithelial  lesions  was  done  using  the  2004  WHO  classification 
system  for  lung  preneoplastic  lesions  (3).  For  Fusl  expression  analysis, 
squamous  dysplasias  were  arranged  into  two  groups:  (a)  low-grade, 
mild  and  moderate  dysplasias  (n  =  14);  and  (b)  high-grade,  severe 
dysplasia  and  carcinoma  in  situ  (n  =  48). 

Immunohistochemical  staining  and  evaluation.  The  rabbit  anti-Fusl 
polyclonal  antibody  used  for  immunohistochemical  staining  was  raised 
against  a  synthetic  oligopeptide  derived  from  NH2-terminal  amino  acid 
sequence  (NH2-GASGSKARGFWPFAAC;  ref.  11).  Formalin-fixed  and 
paraffin-embedded  tissue  histology  sections  (5  pm  thick)  were 
deparaffinized,  hydrated,  and  heated  in  a  steamer  for  10  min  with 
10  mmol/F  of  sodium  citrate  (pH  6.0)  for  antigen  retrieval.  Peroxide 
blocking  was  done  with  3%  H202  in  methanol  at  room  temperature  for 
15  min,  followed  by  10%  bovine  serum  albumin  in  TBS-t  for  30  min. 
The  slides  were  incubated  with  primary  antibody  at  1:400  dilution  for 
65  min  at  room  temperature.  After  washing  with  PBS,  incubation  with 
biotin-labeled  secondary  antibody  for  30  min  followed.  Finally,  the 
samples  were  incubated  with  a  1:40  solution  of  streptavidin-peroxidase 
for  30  min.  The  staining  was  then  developed  with  0.05%  3', 3- 
diaminobenzidine  tetrahydrochloride  prepared  in  0.05  mol/F  of  Tris 
buffer  at  pH  7.6  containing  0.024%  H202  and  then  counterstained  with 
hematoxylin.  Formalin-fixed  and  paraffin-embedded  lung  tissues  with 
normal  bronchial  epithelia  were  used  as  a  positive  control.  For  a 
negative  control,  we  used  the  same  specimens  used  for  the  positive 
controls,  replacing  the  primary  antibody  with  PBS. 

Fusl  immunostaining  was  detected  in  the  cytoplasm  of  epithelial 
and  tumor  cells  (Fig.  1).  Immunohistochemical  expression  was 
quantified  by  two  independent  pathologists  (F.  Prudkin  and  I.I. 
Wistuba)  using  a  four-value  intensity  score  (0,  1+,  2+,  and  3+)  and 
the  percentage  of  the  reactivity  extent.  A  consensus  value  on  both 
intensity  and  extension  was  reached  by  the  two  independent  observers. 


Table  1.  Demographic  and  clinicopathologic  data  regarding  the  lung  cancer  and  respiratory  epithelial  samples 
studied  for  Fusl  immunohistochemical  expression 


Type  and  histology  of  samples  No. 

Sex 

Stage* 

Smoking  history f 

Smoking  status 1 

F 

M 

I 

II 

Ill 

IV 

Yes 

No 

Never 

Former 

Current 

Total  cancers 

303 

155 

148 

— 

— 

— 

— 

— 

— 

— 

— 

— 

SCLC 

22 

7 

15 

— 

— 

— 

— 

22 

0 

0 

3 

7 

NSCLC 

281 

148 

133 

178 

57 

38 

8 

194 

80 

80 

117 

73 

Adenocarcinoma 

172 

105 

67 

118 

22 

27 

5 

108 

61 

61 

63 

41 

Squamous  cell  carcinoma 

109 

43 

66 

60 

35 

11 

3 

86 

19 

19 

54 

32 

Total  epithelial  foci 

329 

136 

193 

— 

— 

— 

— 

274 

50 

50 

120 

145 

Normal  epithelium 

68 

35 

33 

— 

— 

— 

— 

53 

15 

15 

26 

27 

Hyperplasia 

120 

31 

89 

— 

— 

— 

— 

107 

11 

11 

50 

57 

Squamous  metaplasia 

23 

10 

13 

— 

— 

— 

— 

18 

2 

2 

9 

9 

Low  grade  dysplasia 

14 

6 

8 

— 

— 

— 

— 

14 

0 

0 

8 

6 

High  grade  dysplasia 

48 

12 

36 

— 

— 

— 

— 

44 

4 

4 

11 

24 

AAH 

56 

42 

14 

— 

— 

— 

— 

38 

18 

18 

16 

22 

^Staging  is  shown  only  for  NSCLC  cases.  All  SCLCs  were  limited  stage. 
f  Patient  smoking  history  was  not  available  for  seven  NSCLC  and  five  epithelial  specimens. 

*  Patient  smoking  status  was  not  available  for  12  SCLC,  11  NSCLC,  and  14  epithelial  specimens. 
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SCLC 


Squamous  Cell  Carcinoma 


Adenocarcinoma 


Fig.  1.  Representative  examples  of  Fusl  immunohistochemical  staining  of  lung  cancer  specimens.  SCLC  with  reduced  (A)  and  negative  ( B )  Fusl  expression.  Squamous  cell 
carcinoma  with  high  (C)  and  negative  (D)  Fusl  expression.  Adenocarcinoma  with  high  (£)  and  negative  (F)  Fusl  expression.  A  to  F  original  magnification, 
x400  {pictures ),  and  x40  {insets). 


Correlation  analyses  were  done  between  the  three  quantifications  and 
all  showed  statistical  significance  (r  =  0.75;  P  <  0.001).  A  final  con¬ 
sensual  score  was  obtained  by  multiplying  both  intensity  and  extension 
values  (range,  0  -  300),  and  four  levels  of  staining  were  calculated  based 
on  that  score:  ( a )  negative  (score  0),  (b)  low  (score  <100),  (c) 
intermediate  (score  >100  to  <200),  and  ( d )  and  high  (score  >200) 
expressions.  On  the  basis  of  the  high  level  of  expression  detected  in 
normal  bronchial  epithelium  specimens,  high  score  levels  were  defined 
as  preserved  staining  pattern,  whereas  intermediate  and  low  score  levels 


were  defined  as  reduced  staining  pattern,  and  negative  score  as  loss  of 
expression.  Levels  and  scores  were  used  for  analysis. 

Statistical  analysis.  The  data  were  summarized  using  standard 
descriptive  statistics  and  frequency  tabulations.  Associations  between 
categorical  variables  were  assessed  via  cross-tabulation,  x2  test  and 
Fisher's  exact  test.  Wilcoxon  rank  sum  test  and  Kruskal-Wallis  test  were 
done  to  assess  the  differences  between  patients'  clinicopathologic 
groups  with  respect  to  continuous  variables.  Survival  curves  were 
estimated  using  the  Kaplan-Meier  method.  Univariate  Cox  proportional 
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hazard  models  were  used  to  assess  the  effect  of  covariates  on  over¬ 
all  survival  and  recurrence-free  survival.  All  computations  were  done 
using  SAS  and  Splus  2000  (Insightful)  statistical  software.  The  mixed 
effect  model  was  used  to  assess  the  differences  in  scores  between 
normal  and  abnormal  epithelia.  The  generalized  estimating  equation 
approach  was  used  to  estimate  differences  in  the  means  for  the  data 
in  Table  2. 


Results 

Fusl  immunohistochemical  expression  in  lung  cancer  speci¬ 
mens.  Lung  cancer  histologies  varied  in  their  pattern  of 
immunohistochemical  expression  of  Fusl  in  the  cytoplasm  of 
the  tumor  cells.  We  detected  a  statistically  significant  difference 
(P  =  0.001)  in  the  Fusl  expression  mean  score  among  the  three 
major  types  of  lung  cancer  histologies  examined.  SCLCs  had  the 
lowest  mean  score  (57;  SD,  67.4),  with  tumors  having  either  no 
protein  expression  (41%)  or  reduced  expression  (59%;  Table  2; 
Fig.  1).  Squamous  cell  carcinomas  and  adenocarcinomas 
showed  intermediate  levels  of  Fusl  expression,  with  mean 
scores  of  127  (SD,  91.8)  and  111  (SD,  79.1),  respectively. 
Overall,  82%  (230  of  281)  of  NSCLCs  had  lower  Fusl 
expressions  (69%)  or  no  Fusl  expression  (13%).  We  note  that 
the  difference  between  squamous  cell  and  adenocarcinoma 
histologies  was  not  shown  to  be  statistically  significant  at  the 
0.05  level  (P  =  0.07;  Table  2).  There  was  a  significant  difference 
(P  =  0.0008)  in  the  levels  of  Fusl  expression  between  SCLCs 
and  NSCLCs  (Table  2).  Overall,  lung  tumor  specimens  showed 
lower  scores  and  levels  of  expression  compared  with  normal 
epithelium.  Most  tumors  (231  out  of  303,  76%;  91%  of  SCLCs 
and  75%  of  NSCLCs)  had  a  score  of  <200,  which  was  never 
associated  with  normal  tissue. 


Correlation  between  Fusl  immunohistochemical  expression  in 
NSCLC  and  clinicopathologic  features.  Using  Fusl  expression 
mean  scores  and  score  levels,  we  detected  no  statistically  sig¬ 
nificant  correlation  and/or  association  between  protein  expres¬ 
sion  and  the  clinicopathologic  data,  including  sex,  age,  ethnicity, 
smoking  history,  and  tumor-node-metastasis  pathologic  stage. 
Overall  and  disease-free  survival  analyses  for  Fusl  expression 
were  done  in  280  patients  with  tumor-node-metastasis  stages  I 
to  IV  (median  follow-up,  3.90  years),  and  in  218  patients  with 
stages  I  and  II  (median  follow-up,  4.03  years)  NSCLCs,  who  did 
not  receive  neoadjuvant  or  adjuvant  therapies.  Of  interest,  in 
both  univariate  and  multivariate  Cox  model  analyses,  the  hazard 
ratios  for  overall  survival  were  much  lower  in  cases  having  any 
level  of  Fusl  expression  (low,  intermediate,  and  high)  compared 
with  absence  (negative)  of  protein  expression  (Table  3).  These 
differences  were  statistically  significant  in  most  comparisons. 
Although  the  hazard  ratios  for  recurrence-free  survival  showed 
similar  trends  than  overall  survival,  the  P  values  were  not 
statistically  significant  (Supplementary  Table  SI). 

Fusl  immunohistochemical  expression  in  the  sequential  path¬ 
ogenesis  of  lung  cancer.  To  characterize  the  pattern  of  Fusl 
expression  in  the  sequential  pathogenesis  of  NSCLC,  we 
investigated  the  protein  immunohistochemical  expression  in 
histologically  normal  epithelium,  hyperplasia,  squamous 
metaplasia,  and  squamous  dysplasia  bronchial  sites  obtained 
from  surgically  resected  NSCLC  specimens  (Fig.  2).  We  also 
examined  56  AAH  lesions,  a  putative  precursor  lesion  for 
adenocarcinomas  (3,  4).  Mean  scores  and  score  levels  of 
Fusl  cytoplasmic  expression  in  the  epithelia  were  used  for 
comparison  between  ah  different  epithelial  histologic  catego¬ 
ries.  Overall,  normal,  mildly  abnormal,  and  preneoplastic 
respiratory  epithelia  showed  higher  mean  scores  and  score 


Table  2.  Fusl  immunohistochemical  expression  in 
specimens 

lung  cancer  and  epithelial  foci 

using  tissue  microarray 

Histology 

No.  of 

Fusl  score. 

Fusl  score  levels 

of  samples 

samples 

mean  (SD) 

Lost 

(negative) 
n  (%) 

Reduced  (low  + 
intermediate) 
n  (%) 

Preserved 
(high)  n  (%) 

P  value,  Fusl  levels 

Cancer  specimens 

SCLC 

22 

57  (67.4) 

9(41) 

13  (59) 

0 

Comparison  between  tumors 
0.0008 

NSCLC 

281 

121  (87.3) 

36(13) 

194  (69) 

51 (18) 

Adenocarcinoma 

172 

127  (91.8) 

25  (15) 

110  (64) 

37  (22) 

0.07 

Squamous  cell 

109 

111  (79.1) 

11  (10) 

84  (77) 

14(13) 

carcinoma 

Epithelial  specimens 

Normal  epithelium 

68 

251  (46.6) 

0 

27  (40) 

41  (60) 

Comparison  between  normal 
and  abnormal  epithelia* 

Score  Level 

Ref f  Ref f 

Basal  cell  hyperplasia 

120 

263  (47.5) 

0 

40  (33) 

80  (67) 

0.35  0.68 

Squamous  metaplasia 

23 

215  (46.0) 

0 

18  (78) 

5(22) 

0.014  0.014 

Squamous  dysplasia 

62 

224  (51.3) 

0 

25  (45) 

31  (55) 

0.0004  0.047 

Low-grade  dysplasia 

14 

200  (39) 

0 

12 (86) 

2(14) 

0.010  0.28 

High-grade  dysplasia 

48 

231  (53) 

0 

23  (48) 

25  (52) 

0.005  0.019 

AAH 

56 

250  (52.3) 

0 

25  (45) 

31  (55) 

0.61  0.79 

*Mixed  effect  model  was  used  to  assess  difference  of  scores  between  normal  and  abnormal  epithelia.  A  GEE  model  was  used  to  compare  the 

differences  in  FUS1  level  between  normal  and  abnormal  epithelia. 
f  Normal  epithelium  was  used  as  reference  value  (Ref). 
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Table  3.  Univariate  and  multivariate  Cox  model  assessing  effects  of  covariates  on  overall  survival 


(A)  Univariate  Cox  model 


Variable 

All  stages  (#1  = 

280) 

Stages  I  and  II  (n 

=  213) 

Hazard  ratio 

P 

Hazard  ratio 

P 

Age 

1.044 

<0.0001 

1.057 

<0.0001 

Histology 

Squamous  cell  vs.  adenocarcinoma 

1.585 

0.02 

1.667 

0.03 

Tumor* 

T2  +  3  +  4  vs.  T ! 

2.024 

0.002 

— 

— 

T2  +  3  vs.  Ti 

— 

— 

2.175 

0.002 

Lymph  node* 

Ni  +  2  vs.  N0 

2.122 

0.0003 

2.531 

0.0002 

Ni  vs.  N0 

— 

— 

— 

— 

Tumor  stage* 

II  +  III  +  IV  vs.  I 

2.01 

0.0005 

— 

— 

II  vs.  I 

— 

— 

2.47 

0.0002 

Fusl  score  mean 

0.999 

0.70 

0.999 

0.61 

Fusl  score  level 

Low  f  vs.  negative  * 

0.535 

0.06 

0.502 

0.07 

Intermediate  f  vs.  negative  * 

0.744 

0.35 

0.831 

0.60 

H igh§  vs.  negative  * 

0.685 

0.28 

0.536 

0.13 

(B)  Multivariate  Cox  model 

Variable 

All  stages 

Stages  I  and  II 

Hazard  ratio 

P 

Hazard  ratio 

P 

Age 

1.047 

<0.0001 

1.059 

<0.0001 

Histology 

Squamous  cell  vs.  adenocarcinoma 

1.528 

0.04 

1.438 

0.13 

Tumor  stage* 

II  +  III  +  IV  vs.  I 

2.163 

0.0002 

— 

— 

II  vs.  I 

— 

— 

2.519 

0.0004 

Fusl  score  level 

Low f  vs.  negative  * 

0.357 

0.002 

0.312 

0.003 

Intermediate  f  vs.  negative  * 

0.474 

0.02 

0.544 

0.10 

High§  vs.  negative  * 

0.518 

0.06 

0.415 

0.03 

*Pathologic  tumor,  lymph  node,  and  metastasis  stage. 

fLow  and  intermediate,  reduced  Fusl  immunohistochemical  expression. 

*Negative,  absence  of  Fusl  immunohistochemical  expression. 

§High,  preserved  or  normal  Fusl  immunohistochemical  expression. 


levels  of  Fusl  expression  compared  with  tumors,  and  none  of 
the  nonmalignant  epithelial  specimens  showed  loss  of  expres¬ 
sion.  Normal  and  hyperplastic  epithelia  displayed  high  mean 
scores  and  score  levels  of  Fusl  expression  (Table  2),  with  high 
expression  score  levels  in  60%  (41  of  68)  and  67%  (80  of  120) 
of  normal  and  hyperplastic  foci,  respectively.  Squamous 
metaplasia  and  dysplasia  lesions  had  significantly  lower  score 
means  and  score  levels  of  Fusl  expression  compared  with 
histologically  normal  and  hyperplastic  epithelia,  with  78%  and 
57%  of  the  squamous  metaplastic  and  dysplastic  lesions, 
respectively,  showing  a  lower  protein  expression  (Table  2). 
No  differences  were  detected  in  the  level  of  Fusl  expression 
comparing  low-grade  (mild  and  moderate  dysplasias)  and 
high-grade  (carcinoma  in  situ  and  severe  dysplasia)  squamous 
dysplastic  lesions.  No  significant  differences  in  the  mean  scores 
and  score  levels  of  Fusl  expression  were  detected  comparing 
normal  and  hyperplastic  bronchial  epithelium  with  AAH 
lesions.  No  significant  associations  were  observed  between 
Fusl  expression  and  age,  sex,  or  smoking  status  of  NSCLCs 
patients  from  whom  epithelial  specimens  were  obtained. 

Discussion 

Mutations  in  3p21.3  genes  are  rarely  found  in  human 
tumors,  including  lung  cancer  (7).  Therefore,  some  mecha¬ 
nisms  other  than  the  classic  two-hit  model,  which  requires 
mutation  in  one  allele  and  silencing  or  loss  on  another  allele, 
might  be  of  importance  in  the  ultimate  inactivation  of  3p21.3 


genes.  These  alternative  mechanisms  include  promoter  meth- 
ylation,  haploinsufficiency,  altered  RNA  splicing,  as  well  as 
defects  in  transcriptional,  translational,  and  posttranslational 
processes.  In  lung  cancer,  only  a  few  FUS1  mutations  that  alter 
or  truncate  amino  acid  sequences  have  been  detected  (7),  and 
its  promoter  methylation  is  a  rare  phenomenon  (11).  It  has 
been  hypothesized  that  this  gene  is  inactivated  in  lung  tumors 
by  alternative  mechanisms,  such  as  influence  from  the  stochastic 
effects  of  3p21.3  allele  haploinsufficiency  (6)  and  a  posttrans¬ 
lational  modification  of  the  gene  product  by  deficient  N- 
myristoylation  of  the  Fusl  protein  (11).  It  has  been  shown  that 
myristoylation  is  required  for  FUS1  -mediated  tumor-suppress¬ 
ing  activity,  suggesting  a  novel  mechanism  for  the  inactivation 
of  tumor-suppressor  genes  for  human  cancers  (11).  Uno  et  al. 
(11),  using  surface-enhanced  laser  desorption/ionization  time- 
of-flight  mass  spectrometry,  analyzed  the  N -myristoylation 
status  of  Fusl  protein  in  frozen  lung  cancer  tissue  specimens 
and  determined  that  protein  loss  of  immunohistochemical 
expression  correlated  with  loss  of  protein  myristoylation. 

Thus,  to  investigate  the  frequency  and  pattern  of  Fusl  protein 
expression  abnormalities  in  lung  cancer,  we  examined  the 
immunohistochemical  expression  of  the  protein  using  tissue 
microarrays  in  a  large  series  of  primary  tumor  specimens  with 
annotated  clinical  information.  Our  findings  confirmed  and 
further  expanded  previously  reported  data  on  loss  of  Fusl 
protein  expression  in  20  NSCLC  tissue  specimens  (11).  Lung 
tumors  showed  lower  levels  of  expression  than  normal 
bronchial  epithelium.  We  found  that  loss  or  reduction  of 
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Fig.  2.  Representative  examples  of  Fusl  immunohistochemical  expression  in  histologically  normal  {A),  hyperplastic  ( B ),  metaplastic  (C),  and  dysplastic  (D-F)  bronchial 
epithelia.  High  levels  of  expression  are  shown  in  the  normal  and  hyperplastic  epithelia  and  reduced  levels  in  squamous  metaplasia  and  dysplasias.  Original  magnification, 
x400. 


Fusl  immunohistochemical  expression  was  present  in  all 
SCLCs  and  most  (82%)  of  the  NSCLCs  (87%  of  the  squamous 
cell  carcinomas  and  79%  of  the  adenocarcinomas).  Fusl 
protein  expression  was  absent  in  half  of  the  SCLCs  and  in 
13%  of  the  NSCLCs.  For  NSCLCs,  we  found  that  loss  of  Fusl 
expression  is  a  significant  independent  adverse  prognostic 
factor  for  patients'  overall  survival.  Our  findings  that  retention 
of  any  level  of  Fusl  immunohistochemical  expression  signif¬ 
icantly  correlates  with  better  outcome  in  NSCLC  are  in 
agreement  with  the  potent  effect  of  tumor  suppressor  activity 
of  FUS1  gene  as  shown  by  in  vitro  and  in  vivo  studies  (11,  12). 
Although  FUS1  is  one  of  the  nine  putative  tumor-suppressor 
genes  located  in  the  3p21.3  region  frequently  deleted  in  lung 


and  other  tumor  sites  (7),  our  study  is  the  first  report  on  Fusl 
protein  expression  abnormalities  in  a  large  series  of  human 
tumors  arising  at  any  site. 

The  ability  to  rescue  the  tumor  phenotype  by  gene 
inactivation  after  the  replacement  of  those  genes  is  one  of  the 
criteria  needed  for  a  gene  to  be  considered  a  tumor-suppressor 
gene.  There  is  evidence  indicating  that  the  replacement  of 
FUS1  in  nonexpressive  NSCLC  cell  lines  inhibits  cell  growth 
and  induces  apoptosis  (9,  10).  More  importantly,  Ji  et  al.  (9) 
showed  that  intratumoral  injection  of  adenovirus -FUS1  sig¬ 
nificantly  reduces  tumor  growth  in  FUS1  region  -  deficient 
tumor  xenografts  and  experimental  metastasis.  Following 
in  vitro  results,  Ito  et  al.  (12)  reported  that  intratumoral  and 
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intravenous  administration  of  FUS1  complexed  to  nanopar¬ 
ticles  in  mice  bearing  human  lung  cancer  cell  line  xenografts 
resulted  in  the  inhibition  of  tumor  growth,  a  decreased  number 
of  metastases,  and  prolonged  survival  compared  with  results  for 
untreated  animals.  The  restoration  of  tumor- suppressor  genes 
altered  in  cancer  development  is  currently  a  valid  therapeutic 
approach.  On  the  basis  of  preclinical  studies,  a  phase  I  clinical 
trial  by  our  group  using  FUS1  -mediated  molecular  therapy  by 
systemic  administration  of  FUS1  -nanoparticles  is  currently 
under  way  in  stage  IV  NSCLC  patients  at  The  University  of 
Texas  M.D.  Anderson  Cancer  Center.  Our  findings  of  a  lack  or 
reduction  of  expression  of  Fusl  protein  in  most  lung  tumors 
supports  the  concept  of  the  delivery  of  functional  FUS1  as  an 
effective  therapeutic  strategy  for  human  lung  cancer. 

Lung  cancers  are  believed  to  be  the  consequence  of  a  series  of 
progressive  preneoplastic  changes  in  the  respiratory  mucosa  that 
accumulate  a  sequence  of  genetic  changes  (14).  These  genetic 
abnormalities  are  frequently  extensive  and  multifocal  through¬ 
out  the  respiratory  epithelium,  indicating  a  field  effect  or  field 
cancerization  phenomenon  (4).  Although  the  sequential 
preneoplastic  changes  have  been  defined  for  squamous  carci¬ 
nomas,  they  have  been  poorly  documented  for  lung  adenocar¬ 
cinomas  and  SCLCs  (4).  Mucosal  changes  in  the  large  airways 
that  may  precede  squamous  cell  carcinoma  include  squamous 
dysplasia  and  carcinoma  in  situ  in  the  central  airway  (15,  16). 
Adenocarcinomas  may  be  preceded  by  morphologic  changes, 
including  AAH  in  peripheral  airway  cells  (17).  In  squamous  cell 
carcinoma  pathogenesis,  genetic  abnormalities  commence  in 
histologically  normal  epithelium  and  augment  with  increasing 
severity  of  histologic  changes,  with  3p21.3  allelic  loss  as  the 
earliest  genetic  change  being  detected  in  patients  with  lung 
cancer  and  smokers  (6,  8).  Our  findings  of  a  significant 
reduction  of  Fusl  protein  immunohistochemical  expression  in 
squamous  metaplasia  and  dysplasia  histologies  compared  with 
histologically  normal  and  hyperplastic  epithelia  suggest  that  the 
reduction  and  partial  inactivation  of  3p21.3  FUS1  gene 
expression  is  an  early  phenomenon  in  the  pathogenesis  of 
squamous  cell  carcinoma.  AAH  lesions  showed  some  reduction 
in  the  level  of  Fusl  immunohistochemical  expression  in  a  subset 
of  cases  (45%),  but  no  differences  compared  with  histologically 


normal  and  hyperplastic  epithelia  were  detected.  Although  3pl2 
and  3pl4  allelic  losses  have  been  shown  in  ~  20%  of  AAHs,  no 
detailed  mapping  analysis  of  chromosome  3p  that  includes  the 
FUS1  3p21.3  region  has  been  done  in  these  lesions  (18). 

Although  a  total  loss  of  Fusl  protein  immunohistochemical 
expression  was  detected  in  50%  of  SCLCs  and  in  13%  of 
NSCLSs,  no  normal  and  abnormal  epithelial  sites  showed  a 
complete  lack  of  Fusl  protein  expression.  Because  most  lung 
cancers  and  adjacent  preneoplastic  lesions  have  shown  allelic 
loss  at  the  3p21.3  region  (6),  FUS1  haploinsufficiency  may  play 
a  role  in  the  inactivation  of  Fusl  protein  in  lung  cancer 
pathogenesis  (19).  In  diploid  cells,  each  gene  exists  in  two 
copies,  in  contrast  to  haploids  in  which  each  cell  contains  a 
single  copy  of  the  genome.  When  one  of  the  alleles  is  mutated 
or  deleted,  there  is  an  ~  50%  reduction  in  the  level  of  proteins 
synthesized.  Generally,  the  haploinsufficiency  occurs  when  the 
level  of  proteins  synthesized  decreases  below  a  threshold  level 
and  is  insufficient  for  the  onset  of  some  desired  biological 
activity.  We  have  detected  lower  Fusl  protein  expression  in  a 
subset  of  all  epithelial  foci  examined,  including  histologically 
normal,  hyperplastic,  metaplastic,  and  preneoplastic  epithelia. 
Interestingly,  the  reduction  of  Fusl  expression  in  epithelial 
specimens,  including  40%  of  histologically  normal  epithelia, 
was  observed  in  samples  from  both  smoker  and  never-smoker 
NSCLC  patients,  suggesting  that  this  phenomenon  is  not 
necessarily  associated  with  smoking.  We  speculate  that  allelic 
loss  and  the  haplotype  in  the  FUS1  3p21.3  region  at  very  early 
stages  of  lung  tumor  pathogenesis  may  lead  to  a  reduction  of 
Fusl  protein  synthesis.  We  hypothesize  that  a  deficiency  in 
myristoylation  might  lead  to  a  greater  reduction  and  complete 
loss  of  Fusl  protein  expression  in  the  later  stages  of  tumor 
development,  such  as  in  microinvasive  and  invasive  lesions. 

In  summary,  our  findings  show  a  high  frequency  of  Fusl 
protein  reduction  and  loss  of  expression  in  SCLC  and  NSCLC 
tissue  specimens,  and  suggest  that  reduction  of  this  protein  may 
play  an  important  role  in  the  early  pathogenesis  of  squamous 
cell  carcinomas.  All  these  findings  support  the  concept  that 
FUS1  gene  and  protein  abnormalities  could  be  used  to  develop 
new  strategies  for  molecular  cancer  therapy  for  a  significant 
subset  of  lung  tumors. 
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Overall  survival  Probability 


Supplementary  Figure  SI.  Overall  Survival  curve 


Overall  Survival  by  FUs-1  Grade  in  Stages  I  &  II  patients  without  Neo-Adjuvant  TX 


Supplementary  Table  SI.  Univariate  Cox  model  assessing  effects  of  covariates  on  recurrence  free  survival 


Variable 

All  stages  (N=280) 

Stages  I  and  II  (N=213) 

Hazard  ratio 

p- Value 

Hazard  ratio 

p- Value 

Age 

1.012 

0.30 

1.028 

0.08 

Histology 

Squamous  cell  vs.  adenocarcinoma 

0.876 

0.61 

1.185 

0.5892 

Tumor1 

T2+3+4  vs.  T1 

1.404 

0.19 

— 

— 

T2+3  vs.  T1 

— 

— 

1.352 

0.3412 

Lymph  node1 

Nl+2  vs.  NO 

2.629 

0.0001 

— 

— 

N1  vs.  NO 

— 

— 

1.852 

0.092 

Tumor  stage1 

II+III+IV  vs.  I 

2.564 

0.0001 

— 

— 

II  vs.  I 

— 

— 

1.916 

0.0596 

Fusl  score  mean 

1.000 

0.75 

0.999 

0.6923 

Fusl  score  level 

Low2  vs.  negative3 

0.645 

0.29 

0.551 

0.23 

Intermediate2  vs.  negative3 

0.790 

0.57 

0.708 

0.48 

High4  vs.  negative3 

0.939 

0.89 

0.614 

0.37 

1  Pathology  TNM  stage  =  tumor,  lymph  node  and  metastasis. 

2  Low  and  intermediate  =  reduced  Fusl  IHC  expression. 

3  Negative  =  absence  of  Fusl  IHC  expression. 

4  High  =  preserved  or  normal  Fusl  IHC  expression. 


Supplementary  Table  S2.  Demographic  and  clinico-pathologic  data  regarding  the  lung  cancer  and  respiratory  epithelial  samples 
studied  for  Fusl  immunohistochemical  expression 


Type  and  histology  of  samples 

No. 

(%) 

Sex 

F 

(%) 

M 

I 

Stage1  (%) 

II  III 

IV 

Smoking 
history2  (%) 
Yes  No 

Smoking  status  (%) 

Never  Former  Current 

303 

155 

148 

— 

— 

— 

— 

— 

— 

— 

— 

— 

Total  cancers 

(51.2)  (48.8) 

SCLC 

22 

7 

15 

— 

— 

— 

— 

22 

0 

0 

3 

7 

(7.3) 

(31.9)  (68.2) 

(100) 

(0) 

(0) 

(30) 

(70) 

NSCLC 

281 

148 

133 

178 

57 

38 

8 

194 

80 

80 

117 

73 

(92.7) 

(52.7) 

(47.3)  (63.4)  (20.3) 

(13.5)  (2.8) 

(70.8) 

(29.2) 

(29.6) 

(43.3) 

(27.1) 

Adenocarcinoma 

172 

105 

67 

118 

22 

27 

5 

108 

61 

61 

63 

41 

(61.2) 

(61) 

(39) 

(68.6)  (12.8) 

(15.7) 

(2.9) 

(63.9) 

(36.1) 

(36.9) 

(38.2) 

(25.7) 

Squamous  cell  carcinoma 

109 

43 

66 

60 

35 

11 

3 

86 

19 

19 

54 

32 

(38.8) 

(39.5) 

(60.5) 

(55) 

(32.1) 

(10.1) 

(2.8) 

(81.9) 

(18.1) 

(18.1) 

(51.4) 

(30.5) 

Total  epithelial  foci 

329 

136 

193 

_ 

_ 

_ 

_ 

274 

50 

50 

120 

145 

(41.3)  (58.7) 

(84.6) 

(15.4) 

(15.9) 

(38.1) 

(46) 

Normal  epithelium 

68 

35 

33 

— 

— 

— 

— 

53 

15 

15 

26 

27 

(20.6) 

(51.5)  (48.5) 

(77.9) 

(22.1) 

(22.1) 

(38.2) 

(38.7) 

Hyperplasia 

120 

31 

89 

— 

— 

— 

— 

107 

11 

11 

50 

57 

(36.5) 

(25.8)  (74.2) 

(90.7) 

(9.3) 

(9.3) 

(42.4) 

(48.3) 

Squamous  metaplasia 

23 

10 

13 

— 

— 

— 

— 

18 

2 

2 

9 

9 

(V) 

(43.5)  (56.5) 

(90) 

(10) 

(10) 

(45) 

(45) 

Low  grade  dysplasia 

14 

6 

8 

— 

— 

— 

— 

14 

0 

0 

8 

6 

(4.3) 

(42.9)  (57.1) 

(100) 

(0) 

(0) 

(57.1) 

(42.9) 

High  grade  dysplasia 

48 

12 

36 

— 

— 

— 

— 

44 

4 

4 

11 

24 

(14.6) 

(25) 

(75) 

(91.7) 

(8.3) 

(10.3) 

(28.2) 

(61.5) 

AAH4 

56 

42 

14 

— 

— 

— 

— 

38 

18 

18 

16 

22 

(17) 

(75) 

(25) 

(67.8) 

(32.2) 

(32.2) 

(28.6) 

(39.2) 

1  Staging  is  shown  only  for  NSCLC  cases.  All  SCLCs  were  limited  stage. 

2  Patient  smoking  history  was  not  available  for  seven  NSCLC  and  five  epithelial  specimens. 

3  Patient  smoking  status  was  not  available  for  12  SCLC,  1 1  NSCLC,  and  14  epithelial  specimens. 

4  AAH  =  atypical  adenomatous  hyperplasia. 
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Abstract 


Epithelial-to-mesenchymal  transition  (EMT)  is  a  process  in  which  cells  undergo  a  developmental 
switch  from  an  epithelial  to  a  mesenchymal  phenotype.  We  investigated  the  role  of  EMT  in  the 
pathogenesis  and  progression  of  non-small  cell  lung  carcinoma  (NSCLC).  Archived  tissue  from 
adenocarcinoma  and  squamous  cell  carcinoma  primary  tumors  (n=325)  and  brain  metastases 
(n=48)  and  adjacent  bronchial  epithelial  specimens  (n=192)  were  analyzed  for 
immunohistochemical  expression  by  image  analysis  of  E-cadherin,  N-cadherin,  integrin-av(36, 
vimentin,  and  matrix  metalloproteinase-9.  The  findings  were  compared  with  patients’ 
clinicopathologic  features.  High  expression  of  the  EMT  phenotype  (low  E-cadherin  and  high  N- 
cadherin,  integrin-av(36,  vimentin,  and  matrix  metalloproteinase-9)  was  found  in  most  NSCLC 
tumors.  In  primary  tumors,  the  expression  pattern  varied  according  to  the  tumor  histologic  type. 
Low  E-cadherin  membrane  and  high  N-cadherin  cytoplasmic  expression  were  significantly  more 
common  in  squamous  cell  carcinoma  than  in  adenocarcinoma  (P=0.002  and  0.005,  respectively). 
Dysplastic  lesions  had  significantly  lower  expression  of  the  EMT  phenotype  than  did  squamous 
cell  carcinomas,  and  integrin-av(36  membrane  expression  increased  stepwise  according  to  the 
histopathologic  severity.  Brain  metastases  had  decreased  EMT  expression  compared  with 
primary  tumors.  Brain  metastases  had  significantly  lower  integrin-av(36  membrane  (.P=0.04)  and 
N-cadherin  membrane  and  cytoplasm  (P<0.0002)  expression  than  did  primary  tumors. 

The  EMT  phenotype  is  commonly  expressed  in  primary  squamous  cell  carcinoma  and 
adenocarcinoma  of  the  lung;  this  expression  occurs  early  in  the  pathogenesis  of  squamous  cell 
carcinoma.  NSCLC  brain  metastases  showed  characteristics  of  reversed  mesenchymal-epithelial- 
transition.  Our  findings  suggest  that  EMT  is  a  potential  target  for  lung  cancer  chemoprevention 
and  therapy. 
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Introduction 


Lung  cancer  remains  the  leading  cause  of  cancer-related  death  in  the  United  States  (1).  It 
is  a  highly  complex  neoplasm  (2)  that  includes  several  histologic  types,  the  most  common  being 
small  cell  lung  carcinoma  (SCLC)  and  two  types  of  non-SCLC  (NSCLC),  adenocarcinoma  and 
squamous  cell  carcinoma  (3).  Despite  therapeutic  advances,  the  5-year  survival  rate  across  all 
stages  of  the  disease  is  approximately  15%  1,  as  many  patients  are  not  diagnosed  until  the 
disease  is  advanced.  Surgery  is  considered  the  best  treatment  option,  but  only  approximately 
25%  of  NSCLC  tumors  are  suitable  for  potentially  curative  resection  (4).  NSCLC  is  believed  to 
result  from  a  progressive  series  of  lesions  (5).  This  sequence  has  been  clearly  established  in 
squamous  cell  carcinoma,  but  in  adenocarcinoma,  the  progression  sequence  is  less  clear  (5).  A 
deeper  understanding  of  the  molecular  mechanisms  involved  in  the  pathogenesis  and  progression 
of  lung  cancer  may  lead  to  new  and  more  effective  strategies  for  early  detection  and  targeted 
chemoprevention  and  treatment. 

Epithelial-to-mesenchymal  transition  (EMT)  is  a  process  in  which  cells  undergo  a 
developmental  switch  from  an  epithelial  to  a  motile  mesenchymal  phenotype  (6).  This  process 
has  been  related  to  embryologic  morphogenesis,  fibrosis,  and  lately,  to  the  progression  and 
metastasis  of  epithelial  tumors  (7).  Solid  tumor  progression  requires  EMT  for  tumor  cells  to 
invade  and  metastasize.  Colon,  prostate,  and  breast  cancer  are  tumors  with  the  EMT  phenotype 
(8).  In  lung  cancer,  EMT  has  been  studied  in  vitro;  the  expression  of  individual  markers  in  EMT 
has  been  described,  and  these  markers  are  associated  with  prognosis  (9,  10).  These  findings  have 
led  to  the  hypothesis  that  EMT  is  a  target  for  lung  cancer  therapy.  In  addition,  data  suggest  that 
EMT  is  influenced  by  epidermal  growth  factor  receptor  (EGFR)  activity  and  predicts  EGFR 
tyrosine  kinase  inhibitor  (TKI)  sensitivity  (9,  1 1). 

To  better  understand  the  importance  of  EMT  in  NSCLC  pathogenesis  and  progression, 
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we  determined  the  immunohistochemical  expression  of  five  markers  related  to  EMT  (E-cadherin, 
N-cadherin,  integrin-av(36,  vimentin,  and  matrix  metalloproteinase  (MMP)-9)  by  image  analysis 
of  tissue  microarray  (TMA)  specimens.  We  compared  these  findings  with  patients’ 
clinicopathologic  features  and  the  immunohistochemical  expression  of  EGFR  and 
phosphorylated  EGFR  (pEGFR).  In  a  subset  of  adenocarcinomas,  we  compared  EMT  expression 
with  the  presence  of  EGFR  mutations.  In  addition,  we  investigated  the  role  of  EMT  in  the  early 
pathogenesis  of  lung  cancer  by  studying  marker  expression  in  the  preneoplastic  sequence  of 
squamous  cell  carcinoma.  Mesenchymal-to-epithelial  transition  (MET),  the  reverse  of  EMT,  has 
been  hypothesized  to  occur  at  the  site  of  tumor  metastasis  (6,  8).  Therefore,  to  determine  whether 
MET  was  present  in  advanced  lung  tumors,  we  analyzed  EMT  markers  in  lung  cancer  brain 
metastases. 

Materials  and  methods 

Case  selection  and  TMA  construction.  We  obtained  archived,  formalin-fixed,  paraffin-embedded 
(FFPE)  tissue  from  surgically  resected  lung  cancer  specimens  (lobectomies  and 
pneumonectomies)  containing  tumor  and  adjacent  normal  and  abnormal  epithelium  tissues  from 
the  Lung  Cancer  Specialized  Program  of  Research  Excellence  Tissue  Bank  at  The  University  of 
Texas  M.  D.  Anderson  Cancer  Center  (Houston,  TX),  which  has  been  approved  by  the 
institutional  review  board.  The  tissue  had  been  collected  from  1997  to  2001.  The  tissue 
specimens  were  histologically  examined  and  classified  using  the  2004  World  Health 
Organization  classification  system  (3).  We  selected  325  NSCLC  tissue  samples  (209 
adenocarcinomas  and  116  squamous  cell  carcinomas)  for  our  TMA.  TMAs  were  constructed 
using  triplicate  1-mm  diameter  cores  per  tumor;  each  core  included  central,  intermediate,  and 
peripheral  tumor  tissue.  Detailed  clinical  and  pathologic  information,  including  demographics, 
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smoking  history  (never-  and  ever-smokers),  and  smoking  status  (never,  former,  and  current), 
clinical  and  pathologic  TNM  stage,  overall  survival  duration,  and  time  to  recurrence,  was 
available  for  most  cases  (Table  1).  Tumors  were  pathologic  TNM  stages  I-IV  according  to  the 
revised  International  System  for  Staging  Lung  Cancer  (12).  The  expression  of  integrin-av(36  in 
this  NSCLC  TMA  has  been  previously  described  (13). 

To  assess  the  immunohistochemical  expression  of  EMT  markers  in  the  early 
pathogenesis  of  NSCLC,  we  studied  TMAs  containing  192  bronchial  epithelium  specimens 
(normal  histology,  N=49;  basal  cell  hyperplasia,  N=76;  squamous  metaplasia,  N=13;  squamous 
dysplasia,  N=34;  and  carcinoma  in  situ,  N=20)  from  89  patients  with  lung  cancer.  For  statistical 
analysis,  we  grouped  the  lesions  by  tissue  type:  normal  and  reactive  (including  normal, 
hyperplasia,  and  squamous  metaplasia)  and  dysplastic  (including  dysplasia  and  carcinoma  in 
situ).  Finally,  to  determine  the  expression  of  EMT  markers  in  lung  cancer  metastasis,  we 
examined  TMAs  containing  48  brain  metastases  (37  from  adenocarcinomas  and  11  from 
squamous  cell  carcinomas). 

Immunohistochemical  staining  and  evaluation.  Antibodies  against  the  following  molecules  were 
purchased  and  used:  E-cadherin  (Santa  Cruz  Biotechnology,  Santa  Cruz,  CA;  catalogue  number 
Sc-8426;  dilution  1:100),  N-cadherin  (Zymed,  Carlsbad,  CA;  catalogue  number  18-0224, 
concentration  1:100),  integrin-av(36  (Calbiochem,  Gibbstown,  NJ;  catalogue  number  407317; 
dilution  1:300),  vimentin  (Dako,  Carpinteria,  CA;  catalogue  number  M-0725;  dilution  1:400), 
MMP-9  (Genetex,  San  Antonio,  TX;  catalogue  number  GTX-58899;  dilution  1:400),  EGFR,  and 
pEGFR  (Tyr-1086;  Invitrogen,  Carlsbad,  CA;  catalogue  numbers  28-0005  and  36-9700, 
respectively;  dilution  1:100).  Immunohistochemical  staining  was  performed  as  follows:  5-pM- 
thick  FFPE  tissue  sections  were  deparaffmized,  hydrated,  heated  in  a  steamer  for  10  minutes 
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with  10  mM  sodium  citrate  (pH  6.0)  for  antigen  retrieval,  and  washed  in  Tris  buffer.  Peroxide 
blocking  was  performed  with  3%  H202  in  methanol  at  ambient  temperature  for  15  minutes, 
followed  by  10%  bovine  serum  albumin  in  tris-buffered  saline-t  at  ambient  temperature  for  30 
minutes.  The  slides  were  incubated  with  primary  antibody  at  ambient  temperature  and  washed 
with  phosphate-buffered  saline,  followed  by  incubation  with  biotin-labeled  secondary  antibody 
for  30  minutes.  Finally,  the  samples  were  incubated  with  a  1:40  solution  of  streptavidin- 
peroxidase  for  30  minutes.  Staining  was  developed  with  0.05%  3',  3-diaminobenzidine 
tetrahydrochloride,  which  had  been  freshly  prepared  in  0.05  mol/L  Tris  buffer  at  pH  7.6 
containing  0.024%  H202  and  then  counterstained  with  hematoxylin,  dehydrated,  and  mounted. 
FFPE  NSCLC  cell  lines  and  tissues  with  normal  bronchial  epithelia  were  used  as  the  positive 
control.  As  the  negative  control,  we  used  the  same  specimens  used  as  positive  controls  but 
replaced  the  primary  antibody  with  phosphate-buffered  saline. 

Protein  expression  was  quantified  using  the  ARIOL  image  analysis  system  (Applied 
Imaging,  San  Jose,  CA).  This  system  gives  an  immunohistochemistry  reading  that  avoids 
interobserver  and  intraobserver  variability.  For  each  marker,  different  cell  sites  were  evaluated: 
membrane  in  E-cadherin  and  integrin-avp6,  membrane  and  cytoplasm  in  N-cadherin,  and 
cytoplasm  in  vimentin  and  MMP-9.  In  addition,  EGFR  and  pEGFR  (Tyr-1086)  membranous 
staining  was  evaluated.  Membrane  expression  was  scored  on  the  basis  of  the  intensity  and 
completeness  of  immunostaining,  similar  to  Her2-Neu  assessment  classification  (14).  For  the 
statistical  analysis,  we  used  class  score,  grouped  into  two  categories:  0  and  1,  low  expression; 
and,  2  and  3,  high  expression.  Cytoplasm  staining  was  scored  to  account  for  both  the  intensity 
and  extent  of  protein  expression.  These  scores  were  used  in  the  statistical  analysis  of  cytoplasm 
expression.  Vimentin  is  a  special  marker  that  is  highly  expressed  in  stromal  and  inflammatory 
cells,  but  it  is  not  suitable  for  image  analysis.  Therefore,  this  marker  was  scored  by  a  trained 
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pathologist  (L.P.)  using  light  microscopy,  a  four-value  intensity  scale  (0-3+),  and  percentage 
extent  (0%-100%).  The  vimentin  score  was  calculated  by  multiplying  both  parameters  (range,  0- 
300). 

EGFR  mutation  analysis.  Exons  18  through  21  of  EGFR  were  polymerase  chain  reaction  (PCR)- 
amplified  using  intron-based  primers  as  previously  described  (15,  16).  Approximately  200 
microdissected  FFPE  cells  were  used  for  each  PCR  amplification.  All  PCR  products  were 
directly  sequenced  using  the  Applied  Biosystems  PRISM  dye  terminator  cycle  sequencing 
method.  All  sequence  variants  were  confirmed  by  independent  PCR  amplifications  from  at  least 
two  independent  microdissections  and  DNA  extraction  and  sequenced  in  both  directions,  as 
previously  reported  (15,  16). 

Statistical  analysis.  The  data  were  summarized  using  standard  descriptive  statistics  and 
frequency  tabulation.  Associations  between  categorical  variables  were  assessed  via  cross¬ 
tabulation,  the  chi-square  test,  and  Fisher’s  exact  test.  The  Kruskal- Wallis  and  Wilcoxon  rank- 
sum  tests  were  performed  to  determine  the  differences  in  continuous  variables  by 
clinicopathologic  feature.  Survival  curves  were  estimated  using  the  Kaplan-Meier  method. 
Univariate  and  multivariate  Cox  proportional  hazard  models  were  used  to  assess  the  effect  of 
covariates  on  overall  survival  and  recurrence-free  survival.  All  computations  were  carried  out 
using  SAS  (Cary,  NC)  and  S-plus  2000  (Cambridge,  MA)  software.  P-values  smaller  than  0.05 
were  considered  as  statistical  significant. 
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RESULTS 


Immunohistochemical  expression  of  EMT  markers  in  lung  cancer  specimens.  Overall, 
adenocarcinoma  and  squamous  cell  carcinoma  had  high  levels  of  the  EMT  phenotype  on  the 
basis  of  protein  expression  of  the  five  markers  examined  (Figure  1  and  Table  2).  Both  histologic 
types  had  significantly  reduced  levels  of  E-cadherin  membrane  expression,  as  measured  by 
membrane  class  scores,  compared  with  corresponding  adjacent  normal  and  reactive  respiratory 
epithelium  (Figure  2).  In  contrast,  lung  tumors  had  significantly  higher  cytoplasm  scores  of  N- 
cadherin  and  MMP-9  and  membrane  class  scores  of  N-cadherin  and  integrin-av(36  compared 
with  adjacent  respiratory  epithelium  (Figure  2).  Vimentin  cytoplasm  scores  were  significantly 
higher  in  adenocarcinoma  tumors  than  in  normal  and  reactive  epithelium  but  not  in  squamous 
cell  carcinomas  (Figure  2). 

NSCLC  histologic  types  had  varied  patterns  of  immunohistochemical  EMT  marker 
expression  (Table  2).  E-cadherin  membrane  expression  was  low  (classes  0  and  1)  in  63%  of 
tumors,  and  this  low  level  of  expression  was  significantly  more  common  in  squamous  cell 
carcinomas  (73%)  than  in  adenocarcinomas  (57%)  (f^O.005).  High  levels  (classes  2  and  3)  of 
membrane  expression  of  N-cadherin  and  integrin-av(36  were  detected  in  68%  and  56%  of 
NSCLCs,  with  no  differences  by  histologic  type.  However,  the  N-cadherin  cytoplasm  score  was 
significantly  (P=0.002)  higher  in  squamous  cell  carcinoma.  No  significant  differences  in  the 
level  of  cytoplasm  expression  of  vimentin  and  MMP-9  were  detected  between  adenocarcinoma 
and  squamous  cell  carcinoma. 

We  compared  membrane  class  frequencies  and  scores  and  cytoplasm  scores  with 
patients’  demographic  and  clinicopatho logic  characteristics,  including  age,  sex,  smoking  history, 
and  TNM  pathologic  stage.  Interestingly,  N-cadherin  membrane  class  and  cytoplasm  scores  were 
significantly  higher  (P=0.004  and  P=0.0004,  respectively)  in  ever-smokers  than  in  never- 
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smokers.  No  significant  association  was  found  between  any  marker  expression  and  pathologic 
disease  stage.  We  also  determined  the  prognostic  (recurrence-free  and  overall  survival)  effect  of 
EMT  marker  expression;  the  only  marker  that  was  associated  with  outcome  was  integrin-avp6 
membrane  expression.  As  stated  above,  we  previously  found  (13)  that  membrane  integrin-av(36 
is  frequently  overexpressed  in  NSCLC  tumor  specimens  using  the  same  TMA  set  reported  here 
and  that  this  overexpression  significantly  affected  overall  survival  (data  not  shown). 

To  determine  whether  there  was  an  association  between  EMT  phenotype  and  EGFR 
protein  expression  in  NSCLC,  we  compared  the  immunohistochemical  expression  of  our  five 
EMT  markers  with  EGFR  and  pEGFR  (Tyr-1086)  expression  in  225  tumors.  No  association  was 
detected  between  the  immunohistochemical  expression  of  EMT  and  the  EGFR  proteins 
evaluated.  Likewise,  tumors’  EGFR  mutation  status  was  not  associated  with  EMT  marker 
expression  in  a  subset  of  135  adenocarcinomas,  including  32  (24%)  mutant  cases. 

EMT  marker  immunohistochemical  expression  in  the  sequential  pathogenesis  of  squamous  cell 
carcinoma.  We  determined  the  expression  of  five  EMT  markers  in  192  epithelial  specimens 
containing  histologically  normal,  hyperplastic,  squamous  metaplastic,  or  squamous  dysplastic 
bronchial  epithelia  adjacent  to  squamous  cell  carcinomas  obtained  from  89  patients.  We  found  an 
increased  level  of  the  EMT  phenotype,  corresponding  to  increased  histopathologic  severity,  in 
squamous  preneoplasias  (dysplasias  and  carcinoma  in  situ )  compared  with  corresponding 
primary  tumors.  The  membrane  expression  of  E-cadherin  was  significantly  reduced  in  tumors 
compared  with  preneoplastic  lesions  (Figure  3),  but  the  membrane  expression  of  N-cadherin  and 
integrin-av(36  (Figure  3)  and  cytoplasm  expression  of  N-cadherin  and  vimentin  (data  not  shown) 
were  significantly  increased.  Although  squamous  preneoplasias  had  significantly  higher 
expression  than  did  normal  and  reactive  epithelia  of  membrane  integrin-av(36  (Figure  3)  and 
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cytoplasm  MMP-9  (data  not  shown),  we  found  that  preneoplasias  had  significantly  lower 
expression  of  N-cadherin  and  vimentin  than  did  normal  and  reactive  epithelia  (data  not  shown). 
Of  interest,  integrin-ocvp6  membrane  expression  showed  a  clear  stepwise  increase,  according  to 
the  severity  of  histopathologic  lesions.  A  high  level  of  expression  (classes  2  and  3)  was  detected 
in  14  of  73  (19%)  normal  and  reactive  epithelia,  18  of  44  (41%)  squamous  preneoplasias,  and  59 
of  106  (56%)  primary  squamous  cell  carcinomas. 

Immunohistochemical  expression  of  EMT  markers  in  brain  metastases  compared  with  primary 
tumors.  We  studied  the  expression  of  EMT  markers  in  46  NSCLC  brain  metastases,  including  35 
adenocarcinomas  and  1 1  squamous  cell  carcinomas.  Interestingly,  we  found  that  metastasis  sites 
had  a  lower  level  of  the  EMT  phenotype  than  did  primary  tumors,  which  has  been  suggested  to 
correspond  to  MET  (Table  3)  (6).  Squamous  cell  carcinoma  brain  metastases  had  a  significantly 
higher  E-cadherin  membrane  class  score  than  did  primary  tumors.  The  metastasis  sites  of  both 
NSCLC  histologic  types  had  significantly  lower  integrin-av(36  membrane  and  N-cadherin 
membrane  and  cytoplasm  expression  scores  than  did  primary  tumors  (Table  3). 

Discussion 

To  our  knowledge,  this  is  the  first  study  to  fully  characterize  the  expression  of  multiple 
EMT  markers  in  NSCLC  primary  tumors  and  metastases  and  the  pathogenesis  of  squamous  cell 
carcinoma.  The  use  of  TMA  to  assess  tumor  and  epithelial  tissue  specimens  with  annotated 
clinical  information,  coupled  with  image  analysis  of  immunohistochemical  expression,  allowed 
us  to  characterize  the  expression  of  five  EMT  markers  in  a  large  set  of  tumor  and  epithelial 
specimens  comprising  the  entire  spectrum  of  lung  cancer  pathogenesis  and  progression. 

First,  we  demonstrated  that  adenocarcinoma  and  squamous  cell  carcinoma  had  high 
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levels  of  the  EMT  phenotype  compared  with  normal  bronchial  epithelium.  This  phenotype  is 
characterized  by  reduced  expression  of  E-cadherin  and  overexpression  of  N-cadherin,  integrin- 
av(36,  MMP-9,  and  vimentin.  The  differential  expression  of  the  EMT  phenotype  between  normal 
and  tumor  tissues  demonstrates  that  EMT  is  associated  with  NSCLC  tumor  development.  The 
reduced  expression  of  E-cadherin  (10,  17,  18)  and  overexpression  of  N-cadherin  (19),  integrin- 
av(36  (13),  and  MMP-9  (20-22)  has  been  previously  reported  in  NSCLC  tissue  specimens. 
Vimentin  is  considered  a  hallmark  marker  for  the  mesenchymal  differentiation  of  cells,  and  its 
expression  in  NSCLC  cell  lines  has  been  previously  associated  with  other  EMT-related  changes 
(9).  However,  to  our  knowledge,  our  study  is  the  first  to  report  the  expression  of  multiple  EMT 
markers  in  a  large  set  of  NSCLCs  using  image  analysis  assessment  of  immunostaining. 

Tumors  are  defined  by  their  invasive  and  metastatic  potential.  This  potential  may  be 
achieved  by  tumor  cells  via  EMT.  As  with  the  developmental  EMT  process,  multiple  EMT- 
related  histologic,  molecular,  and  transcriptional  changes  occur  during  carcinoma  progression 
(23).  This  process  is  characterized  by  morphologic  modifications,  including  epithelial  polarized 
cells  acquiring  a  motile  non-polarized  appearance,  and  molecular  changes,  including  altered 
expression  of  growth  and  transcription  factors  and  modified  expression  of  cytoskeletal  and 
adhesion  molecules  (6).  The  EMT  process  has  been  described  extensively  for  several  types  of 
cancer,  particularly  colon,  breast,  and  prostate  cancer  (8,  24-27).  In  lung  cancer,  the  loss  or 
switch  of  adhesion  molecules  (10,  28)  and  the  expression  of  classically  mesenchymal  proteins 
have  been  associated  with  the  induction  of  EMT  and  carcinoma  aggressiveness  (9).  In  addition, 
the  expression  of  several  growth  factors  and  their  effectors  (29),  transcription  factors  (30,  31), 
and  proteases  (32,  33)  have  been  associated  with  EMT. 

Loss  of  E-cadherin  membrane  expression  and  the  subsequent  reduction  of  cells’  ability  to 
form  stable  cell-cell  contacts  is  a  hallmarks  of  EMT  (6).  In  NSCLC,  E-cadherin  downregulation 
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occurs  mostly  via  epigenetic  mechanisms,  including  gene  promoter  hypermethylation  (34).  N- 
cadherin  is  mainly  expressed  in  nervous  system  and  mesenchymal  normal  cells.  In  epithelial 
tumors,  increased  N-cadherin  immunohistochemical  expression  (often  de  novo  expression)  is 
accompanied  by  E-cadherin  downregulation  (35).  In  NSCLC,  increased  N-cadherin  membrane 
and  cytoplasm  expression  has  been  previously  described  in  approximately  one-third  of  tumors 
(19).  Integrin-av(36,  an  integrin  transmembrane  glycoprotein,  is  integrated  across  the  plasma 
membrane  and  provides  a  link  between  the  extracellular  matrix  and  cytoskeletal  molecules  (36). 
Integrin-av(36  overexpression  has  been  found  in  other  epithelial  tumors,  including  colorectal 
(24),  ovarian  (37),  and  head  and  neck  carcinomas  (38).  Of  interest,  integrin-av(36  was  recently 
identified  by  biopanning  peptides  in  NSCLC  cell  lines  (13),  and  we  have  reported 
immunohistochemistry  TMA  data  that  validate  this  method  (13).  MMPs  are  part  of  the 
proteolytic  cascade  that  degrades  the  extracellular  matrix  and  allows  the  migration  of  tumor  and 
endothelial  cells  (39).  MMP-9  is  a  gelatinase  capable  of  forming  gaps  in  the  basement  membrane 
to  facilitate  invasion  and  metastasis  (39).  High  levels  of  MMP-9  immunohistochemical 
expression  have  been  reported  in  nearly  60%  of  lung  tumors  (20,  22,  40). 

Our  findings  indicate  that  the  EMT  phenotype  is  homogeneously  present  in  NSCLC;  this 
phenotype  is  associated  with  relatively  few  clinicopathologic  features,  including  tumor  histologic 
characteristics  and  patients’  smoking  status.  Squamous  cell  carcinoma  had  reduced  E-cadherin 
expression  and  increased  N-cadherin  cytoplasmic  expression  compared  with  adenocarcinoma 
histology.  To  our  knowledge,  only  reduced  E-cadherin  expression  in  lung  squamous  cell 
carcinoma  histology  has  been  previously  reported  (41,  42).  Interestingly,  in  our  study,  the  N- 
cadherin  membrane  and  cytoplasm  scores  were  significantly  higher  in  ever-smokers  than  in 
never-smokers.  The  biologic  reason  for  this  phenomenon  is  unknown.  Several  studies  have 
consistently  found  an  association  between  loss  of  E-cadherin  expression  and  poor  prognosis  in 
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NSCLC  patients  treated  with  surgery  with  curative  intent  (10,  17,  28,  42).  To  our  knowledge, 
only  one  study  showed  that  N-cadherin  overexpression  was  associated  with  poor  outcome  in 
NSCLC  patients  (in  a  univariate  analysis)  (19).  We  were  not  able  to  replicate  those  results, 
which  could  be  due  to  our  use  of  different  immunohistochemical  methods,  TMAs  instead  of 
whole  tissue  sections,  or  image  analysis  instead  of  microscope  observation  for  assessment  of 
immunostaining.  MMP-9  immunohistochemical  expression  has  been  found  to  be  associated  with 
prognosis  in  some  studies  (20,  40,  43)  but  not  in  others  (22,  43).  We  had  already  reported  that 
integrin-av(36  immunohistochemical  overexpression  was  associated  with  poor  overall  survival  in 
NSCLC  patients  (13). 

Although  EMT  has  been  associated  with  activation  of  the  EGFR  pathway  (9,  21,  40,  44, 
45),  we  found  no  relationship  between  the  expression  of  EMT  markers  and  EGFR 
immunohistochemical  expression  or  the  EGFR  mutation  status  of  tumors.  Of  interest,  an  EMT 
gene  expression  signature  was  identified  in  NSCLC  cell  lines  that  are  sensitive  to  in  vitro 
exposure  to  the  EGFR  TKI  erlotinib  (9).  Moreover,  the  re-expression  of  E-cadherin  has  been 
shown  to  increase  the  sensitivity  of  NSCLC  cell  lines  to  EGFR  TKIs  (46).  Although  a  few 
studies  have  shown  that  E-cadherin  overexpression  of  NSCLC  tumors  determines  sensitivity  to 
EGFR  TKI  (9),  no  association  was  been  found  between  EGFR  mutation  and  E-cadherin 
expression  in  one  study  (47). 

Lung  cancer  is  believed  to  develop  from  a  series  of  preneoplastic  lesions  in  the 
respiratory  mucosa,  and  these  abnormalities  are  frequently  extensive  and  multifocal  throughout 
the  respiratory  epithelium,  indicating  a  field  effect  or  field  cancerization  phenomenon  (5).  These 
histopathologic  changes  have  been  well  defined  for  squamous  cell  carcinoma,  but  they  are  poorly 
defined  for  adenocarcinoma  (5).  Mucosal  changes  in  the  large  airways  that  may  precede  invasive 
squamous  cell  carcinoma  include  squamous  dysplasia  and  carcinoma  in  situ  in  the  central 
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bronchial  airway  (5).  We  found  increased  EMT  phenotype  levels  (especially  reduced  E-cadherin 
and  increased  N-cadherin  and  integrin-av(36  expression),  and  these  levels  corresponded  with 
increased  histopathologic  severity.  These  findings  indicate  that  EMT  occurs  early  in  the 
pathogenesis  of  lung  cancer.  To  our  knowledge,  only  reduced  E-cadherin  expression  has  been 
previously  described  by  us  (48)  and  others  (49)  to  commence  at  the  squamous  dysplasia  stage.  Of 
the  five  markers  studied,  membrane  integrin-av(36  had  the  clearest  stepwise  increase  in 
expression,  which  corresponded  to  histopathologic  severity  (from  normal  and  reactive  bronchial 
epithelia  to  squamous  dysplasia  and  invasive  squamous  cell  carcinoma).  Thus,  integrin-ocvp6 
may  be  a  novel  marker  for  monitoring  the  progression  of  centrally  located  preneoplastic  lesions 
in  smokers. 

Because  of  their  overexpression  in  malignant  cells  and  their  role  in  cell  survival,  motility, 
and  invasion,  several  EMT -related  proteins,  including  N-cadherin  and  integrins,  have  been 
considered  as  therapeutic  targets  in  cancer  (36,  50).  However,  the  expression  of  these  proteins  in 
the  spectrum  of  advanced  NSCLC,  including  metastasis  sites,  has  not  been  previously  studied.  In 
addition,  to  our  knowledge,  this  is  the  first  study  to  assess  the  expression  of  EMT  markers  in 
lung  cancer  tissue  specimens  obtained  from  metastasis  sites.  It  has  been  suggested  that  the 
progression  of  epithelial  tumors  involves  spatial  and  temporal  occurrences  of  EMT,  whereby 
tumor  cells  acquire  a  more  invasive  and  metastatic  phenotype  (6,  8).  The  disseminated 
mesenchymal  tumor  cells  must  then  undergo  MET  at  the  site  of  metastasis  because  metastasis 
recapitulates  the  pathologic  characteristics  of  the  cells’  corresponding  primary  tumors. 
Interestingly,  we  found  lower  EMT  phenotype  levels  in  lung  cancer  brain  metastases  than  in 
primary  tumors,  confirming  that  cellular  plasticity,  which  allows  cells  to  undergo  EMT  and  MET 
in  the  appropriate  microenvironments,  is  important  in  metastasis.  Our  finding  of  different  levels 
of  the  EMT  phenotype  in  NSCLC  brain  metastases  than  in  primary  tumors  emphasize  the 
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importance  of  characterizing  and  understanding  the  molecular  events  in  lung  cancer  metastasis. 

In  summary,  to  our  knowledge,  this  study  is  the  first  to  comprehensively  evaluate  the 
expression  of  five  EMT-related  proteins  throughout  the  early  development  and  progression  of 
NSCLC.  We  demonstrated  that  the  EMT  phenotype  is  frequently  expressed  in  primary  squamous 
cell  carcinoma  and  adenocarcinoma  of  the  lung;  EMT  is  an  early  phenomenon  in  the 
pathogenesis  of  squamous  cell  carcinoma,  and  NSCLC  brain  metastases  show  characteristics  of 
MET.  Our  findings  suggest  that  EMT  is  a  useful  target  for  lung  cancer  chemoprevention  and 
therapy. 
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Figure  Legends 

Figure  1.  Representative  microphotographs  of  immunohistochemical  expression  of  EMT  markers 
in  tissue  specimens  of  primary  NSCLC  adenocarcinoma  and  squamous  cell  carcinoma.  Tumor 
cells  show  membrane  immunostaining  for  E-cadherin  and  integrin-av(36,  membrane  and 
cytoplasm  staining  for  N-cadherin,  and  cytoplasm  staining  for  vimentin  and  MMP-9. 

Figure  2.  Scores  of  membrane  and  cytoplasm  immunohistochemical  expression  of  EMT  markers 
in  normal  and  reactive  epithelia  compared  with  those  in  primary  tumors.  The  number  of  samples 
is  indicated  for  each  histologic  group  and  marker.  P  values  comparing  normal  epithelial  and 
tumor  histologic  types  are  shown  for  all  comparisons. 

Figure  3.  Scores  for  membrane  immunohistochemical  expression  of  E-cadherin,  N-cadherin,  and 
integrin-av(36  in  bronchial  respiratory  epithelial  lesions  related  to  the  pathogenesis  of  squamous 
cell  carcinoma  of  the  lung:  normal  and  reactive  epithelium,  squamous  preneoplasia  (dysplasias 
and  carcinoma  in  situ),  and  primary  squamous  cell  carcinoma.  The  number  of  samples  is 
indicated  for  each  histologic  group  and  marker.  Significant  P  values  for  comparisons  between 
each  group  are  shown.  For  each  marker,  representative  microphotographs  of 
immunohistochemical  expression  in  tissue  specimens  of  bronchial  epithelium  with  normal, 
squamous  dysplasia,  and  carcinoma  in  situ  are  shown.  For  E-cadherin,  decreased  membrane 
immunostaining  was  found  with  increased  histologic  severity.  The  opposite  phenomenon  is 
shown  for  cytoplasm  N-cadherin  and  membrane  integrin-avp6  expression. 
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Table  1.  Demographic  and  clinicopathologic  data  by  histologic  tumor  type 


Characteristic 

Total 

(N=325) 

Adenocarcinoma 

(N=209) 

Squamous  cell 
carcinoma  (N=116) 

Sex 

Female 

173 

128 

45 

Male 

152 

81 

71 

Stage 

I 

205 

139 

66 

II 

62 

28 

34 

III 

49 

35 

14 

IV 

9 

7 

2 

Smoking  history1 

Yes 

263 

152 

111 

No 

61 

57 

4 

Smoking  status1 

Never 

61 

57 

4 

Former 

161 

92 

69 

Current 

102 

60 

42 

Smoking  history  and  status  was  not  available  for  one  squamous  cell  carcinoma  patient. 
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Table  2.  Immunohistochemical  expression  of  EMT  markers  by  NSCLC  histologic  type 


Marker 

Adenocarcinoma 

Squamous  cell 
carcinoma 

P  value 

Total 

E-cadherin 

Membrane  class 

N 

191 

110 

301 

0  or  1  (%) 

108  (57) 

80  (73) 

0.005 

188  (62) 

2  or  3  (%) 

83  (43) 

30  (27) 

113  (38) 

N-cadherin 

Membrane  class 

N 

193 

106 

0.73 

299 

0  or  1  (%) 

60  (31) 

35  (33) 

95  (32) 

2  or  3  (%) 

133  (69) 

71 (67) 

204  (68) 

Mean  cytoplasm  score 

46.4  (4.2) 

47.8  (2.4) 

0.002 

46.9  (3.7) 

(SD) 

Integrin-avp6 

Membrane  class 

N 

177 

106 

0.89 

283 

0  or  1  (%) 

77  (44) 

47  (44) 

124  (44) 

2  or  3  (%) 

100  (56) 

59  (56) 

159  (56) 

Vimentin 

Cytoplasm  score 

N 

172 

90 

262 

Mean  (SD) 

62.1  (92.6) 

42.2  (72.4) 

0.15 

55.3  (86.6) 

MMP-9 

Cytoplasm  score 

N 

179 

96 

275 

Mean  (SD) 

44.8  (9.9) 

44.9  (8.4) 

0.95 

44.8  (9.4) 
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Table  3.  Immunohistochemical  expression  of  EMT  markers  in  primary  tumors  and  brain  metastases  by  histologic  type 


Adenocarcinoma _ Squamous  cell  carcinoma 


Marker 

Primary  tumor 

Metastasis 

P  value 

Primary  tumor 

Metastasis 

P  value 

E-cadherin 

Membrane  class 

N 

Mean  (SD) 

191 

1.5  (0.6) 

35 

1.6  (0.7) 

0.16 

110 

1.3  (0.5) 

11 

1.6  (0.8) 

0.02 

N-cadherin 
Membrane  class 

N 

Mean  (SD) 

193 

1.7  (0.6) 

37 

1.2  (0.9) 

<0.0002 

106 

1.7  (0.5) 

11 

1  (0.4) 

<0.0002 

Cytoplasm  score 
Mean  (SD) 

46.4  (4.2) 

20.5  (20.1) 

<0.0002 

47.8  (2.4) 

28.6  (20.7) 

<0.0002 

Integrin-avp6 
Membrane  class 

N 

Mean  (SD) 

177 

1.8  (0.9) 

32 

1.2  (0.9) 

<0.0002 

106 

1.8  (0.9) 

11 

1.5  (0.9) 

0.04 

Vimentin 

Cytoplasm  score 

N 

Mean  (SD) 

172 

62.1  (92.6) 

32 

61.3(101.3) 

0.9 

90 

42.2  (72.4) 

10 

15  (30.6) 

0.04 

MMP-9 

Cytoplasm  score 

N 

Mean  (SD) 

179 

44.8  (9.9) 

31 

47.1  (2.9) 

0.07 

96 

44.9  (8.4) 

8 

47.4(1.6) 

0.12 
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Abstract 

Lung  cancer  is  the  leading  cause  of  cancer  deaths  worldwide. 
Recent  advances  in  targeted  therapies  hold  promise  for  the 
development  of  new  treatments  for  certain  subsets  of  cancer 
patients  by  targeting  specific  signaling  molecule.  Based  on  the 
identification  of  the  transcription  factor  cyclic  AMP  response 
element-binding  protein  (CREB)  as  an  important  regulator  of 
growth  of  several  types  of  cancers  and  our  recent  findings  of 
its  importance  in  normal  differentiation  of  bronchial  epithe¬ 
lial  cells,  we  hypothesized  that  CREB  plays  an  important 
pathobiologic  role  in  lung  carcinogenesis.  We  conducted  this 
initial  study  to  determine  whether  the  expression  and 
activation  status  of  CREB  are  altered  in  non- small  cell  lung 
cancer  (NSCLC)  and  of  any  prognostic  importance  in  NSCLC 
patients.  We  found  that  the  expression  levels  of  mRNA  and 
protein  of  CREB  and  phosphorylated  CREB  (p-CREB)  were 
significantly  higher  in  most  of  the  NSCLC  cell  lines  and  tumor 
specimens  than  in  the  normal  human  tracheobronchial 
epithelial  cells  and  adjacent  normal  lung  tissue,  respectively. 
Analysis  of  CREB  mRNA  expression  and  the  CREB  gene  copy 
number  showed  that  CREB  overexpression  occurred  mainly  at 
the  transcriptional  level.  Immunohistochemical  analysis  of 
tissue  microarray  slides  containing  sections  of  NSCLC  speci¬ 
mens  obtained  from  310  patients  showed  that  a  decreased 
survival  duration  was  significantly  associated  with  over¬ 
expression  of  CREB  or  p-CREB  in  never  smokers  but  not  in 
current  or  former  smokers  with  NSCLC.  These  are  the  first 
reported  results  illustrating  the  potential  of  CREB  as  a 
molecular  target  for  the  prevention  and  treatment  of  NSCLC, 
especially  in  never  smokers.  [Cancer  Res  2008;68(15):6065-73] 

Introduction 

Lung  cancer  is  the  leading  cause  of  cancer  deaths,  and  its 
incidence  is  rising  (1).  In  the  United  States,  215,020  new  cases  and 
161,840  deaths  of  lung  and  bronchial  cancer  are  projected  to  occur 
in  2008  (2).  Non-small  cell  lung  cancer  (NSCLC)  accounts  for 
~  80%  of  all  lung  cancers  and  is  subdivided  into  adenocarcinoma, 
squamous  cell  carcinoma,  and  large  cell  carcinoma  (3).  Recent 


Note:  Supplementary  data  for  this  article  are  available  at  Cancer  Research  Online 
(http:/ / cancerres.aacrjournals.org/). 

Current  address  for  M-K.  Kim:  Department  of  Pathology,  Chung-Ang  University 
Medical  Center,  Seoul,  South  Korea. 

Requests  for  reprints:  Ja  Seok  Koo,  Department  of  Thoracic/Head  and  Neck 
Medical  Oncology,  The  University  of  Texas  M.  D.  Anderson  Cancer  Center,  Unit  432, 
1515  Holcombe  Boulevard,  Houston,  TX  77030.  Phone:  713-792-8454;  Fax:  713-794- 
5997;  E-mail:  jskoo@mdanderson.org. 

©2008  American  Association  for  Cancer  Research. 

doi:10. 1 158/ 0008-5472.C  AN-07-537 6 


advances  in  targeted  therapies  directed  against  the  epidermal 
growth  factor  receptor  (EGFR)  and  vascular  endothelial  growth 
factor  receptor  pathways  showed  marked  improvement  in 
treatment  in  a  subset  of  patients  with  lung  cancer  (4-6).  Thus, 
identifying  new  molecular  targets  for  treatment  and/or  prevention 
of  NSCLC  is  warranted  and  urgently  needed  to  improve  the  control 
of  this  deadly  form  of  lung  cancer. 

Studies  have  shown  that  cyclic  AMP  (cAMP)  response  element¬ 
binding  protein  (CREB)  plays  important  roles  in  cell  differentiation 
(7),  survival  (8,  9),  proliferation  (10),  development  (11, 12),  cell  cycle 
progression  (10),  and  glucose  metabolism  (13).  CREB  is  activated 
by  cAMP,  growth  factors,  hormones,  retinoids  (14),  cytokines  (15), 
and  prostaglandins  (16)  via  multiple  signaling  pathways  (13,  17), 
including  the  cAMP/protein  kinase  A,  phosphatidylinositol  3- 
kinase  (PI3K)/Akt,  extracellular  signal-regulated  kinase  (ERK)/p90 
ribosomal  S6  kinase,  and  p38/mitogen-  and  stress-activated  protein 
kinase  pathways.  Once  activated,  CREB  induces  the  expression  of 
cAMP  response  element-containing  target  genes  (13,  18,  19),  which 
play  important  roles  in  differentiation  (14),  cell  cycle  progression 
(20),  apoptosis  suppression  (21),  proliferation  (22),  neovasculariza¬ 
tion  (23),  inflammation  (24),  and  tumorigenesis  (25). 

Recently,  we  showed  that  CREB  plays  a  physiologic  role  in 
mucous  differentiation  of  normal  human  tracheobronchial  epithe¬ 
lial  (NHTBE)  cells  (14,  26).  Previous  studies  found  that  CREB  has  a 
pathobiologic  role  in  the  growth  of  breast  cancer,  melanoma,  and 
hepatocellular  carcinoma  cells  (7,  27,  28).  Recent  studies  also 
showed  that  CREB  acts  as  a  proto-oncogene  to  regulate 
hematopoiesis  and  to  contribute  to  the  leukemia  phenotype 
(29-31).  However,  whether  CREB  expression  is  altered  in  human 
NSCLC  tumors  and  whether  CREB/phosphorylated  CREB  (p-CREB) 
expression  correlates  with  the  survival  rate  in  patients  with  NSCLC 
have  not  been  previously  shown. 

Based  on  these  previous  findings,  we  hypothesized  that  CREB, 
which  plays  an  important  role  in  normal  differentiation  of 
bronchial  epithelial  cells,  may  also  have  an  important  pathobio¬ 
logic  role  in  lung  carcinogenesis  as  a  transcriptional  regulatory 
factor.  To  test  this  hypothesis,  we  compared  the  CREB  expression 
levels  and  activation  statuses  and  the  CREB  gene  copy  numbers  in 
10  NSCLC  cell  lines,  NHTBE  cells,  6  frozen  human  NSCLC  tissue 
specimens,  and  paired  normal  lung  tissue  specimens.  We  also 
analyzed  CREB  and  p-CREB  expression  in  45  paraffin-embedded 
whole  specimens  of  NSCLC  tumor  and  adjacent  normal  bronchial 
or  bronchiolar  epithelial  tissue  specimens.  Lastly,  we  assessed  the 
levels  of  CREB  and  p-CREB  expression  in  association  with 
clinicopathologic  variables  and  overall  survival  duration  of  310 
NSCLC  patients  with  banked  NSCLC  tissue  specimens  using  tissue 
microarray  (TMA)  analysis.  Although  studies  have  consistently 
shown  that  smoking  is  an  important  etiologic  factor  for  lung 
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cancer,  about  15%  of  men  and  53%  of  women  with  this  disease 
worldwide  are  never  smokers  (32).  Thus,  we  also  analyzed  the 
effects  of  CREB  and  p-CREB  expression  on  overall  survival  duration 
in  patients  with  NSCLC  according  to  smoking  status. 

Materials  and  Methods 

NSCLC  tissue  specimens  and  TMA  construction.  Six  frozen  tumor 
tissue  specimens  (three  squamous  cell  carcinoma  and  three  adenocarcino¬ 
ma)  and  six  adjacent  normal  lung  tissue  specimens  surgically  resected  from 
patients  who  underwent  lobectomies  or  pneumonectomies  for  primary 
NSCLC  were  obtained  from  the  tissue  bank  of  The  University  of  Texas  M.  D. 
Anderson  Cancer  Center  Specialized  Program  of  Research  Excellence  in 
Lung  Cancer.  All  of  the  tumors  were  histologically  examined  and  classified 
using  the  2004  WHO  International  Classification  of  Lung  Tumors  (33).  In 
addition,  specimens  of  tumor  and  adjacent  normal  lung  tissue  (including 
bronchial  and  bronchiolar  epithelia)  obtained  from  45  patients  with 


surgically  resected  NSCLC  (26  adenocarcinoma  and  19  squamous  cell 
carcinoma)  were  randomly  selected  for  assessment  of  immunohistochem- 
ical  expression  of  CREB  and  p-CREB  in  whole  histologic  sections.  After 
histologic  examination  of  310  NSCLC  specimens  [194  adenocarcinoma  or 
bronchioloalveolar  carcinoma  (BAC)  and  116  squamous  cell  carcinoma]  in 
the  tissue  bank  of  The  University  of  Texas  M.  D.  Anderson  Cancer  Center, 
tumor  TMAs  were  constructed  using  three  tissue  cores  per  tumor  that  were 
1  mm  in  diameter  to  obtain  tissue  from  central,  intermediate,  and 
peripheral  tumor  areas.  The  M.  D.  Anderson  Cancer  Center  Institutional 
Review  Board  approved  the  use  of  the  archived  clinical  tissue  specimens. 

Smoking  history.  Patients  who  had  smoked  at  least  100  cigarettes  in 
their  lifetime  were  defined  as  ever  smokers,  and  patients  who  quit  smoking 
at  least  12  mo  before  their  lung  cancer  diagnosis  were  defined  as  former 
smokers.  Current  smokers  were  defined  as  active  smokers  who  had  been 
smoking  for  at  least  6  mo.  Subjects  were  asked  whether  they  had  ever 
smoked  any  tobacco  products  (nonfiltered  or  filtered  cigarettes,  cigars,  or 
pipes)  for  at  least  6  mo  and  were  classified  as  never  smokers  if  they  had  not, 
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Figure  1 .  Expression  of  CREB  and  p-CREB  in  NHTBE  and  NSCLC  cells.  A,  Western  blot  analysis  of  CREB  and  p-CREB  expression.  Whole-cell  lysates  were  prepared 
from  fully  differentiated  NHTBE  cells  ( lane  1)  and  the  10  NSCLC  cell  lines  ( lane  2,  H226;  lane  3,  H292;  lane  4,  H520;  lane  5,  H2170;  lane  6,  HI 563;  lane  7, 

HI 734;  lane  8,  HI 975;  lane  9,  H2228;  lane  10,  A549;  lane  11 ,  HI 703)  grown  in  optimal  medium  up  to  confluence.  Equal  amounts  of  lysate  were  subjected  to 
SDS-PAGE  and  Western  blotting.  The  blots  were  probed  with  anti-CREB  and  anti-p-CREB  antibodies.  Equal  protein  loading  was  confirmed  by  stripping  the  blots 
and  reprobing  them  with  an  anti-p-actin  antibody.  The  expression  levels  of  CREB  and  p-CREB  proteins  for  each  cell  line  in  relation  to  the  NHTBE  cells  were 
quantitated.  B,  qRT-PCR  analysis  of  CREB  mRNA  expression.  The  CREB  mRNA  level  in  each  cell  line  was  analyzed  with  qRT-PCR.  The  values  shown  are  the 
ratios  of  the  CREB  mRNA  expressed  in  NSCLC  cells  to  that  expressed  in  NHTBE  cells,  with  CREB  mRNA  levels  normalized  against  the  GAPDH  mRNA  level. 

The  results  shown  are  from  a  representative  experiment  performed  twice,  each  run  in  triplicate.  Columns,  mean;  bars,  SE.  *,  P  <  0.05;  **,  P  <  0.01  versus 
NHTBE  cells  ( lane  1)  with  Student’s  t  test.  C,  CREB  gene  copy  number  analysis  with  PCR.  The  genomic  DNA  obtained  from  each  cell  line  was  subjected  to 
PCR.  The  results  shown  are  from  a  representative  experiment  performed  twice,  each  run  in  triplicate.  The  values  are  the  ratios  of  the  CREB  DNA  copy  number 
in  NSCLC  cells  to  that  in  NHTBE  cells,  with  CREB  DNA  normalized  against  the  p-actin  DNA  level.  Columns,  mean;  bars,  SE.  *,  P  <  0.05;  **,  P  <  0.01  versus 
NHTBE  cells  ( lane  1)  with  Student’s  t  test. 
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Figure  2.  Expression  of  CREB  and  p-CREB  in  frozen  human  NSCLC  specimens  and  adjacent  normal  bronchial  and  bronchiolar  epithelial  tissue  specimens.  A, 
Western  blot  analysis  of  CREB  and  p-CREB  expression.  Soluble  proteins  obtained  from  three  squamous  cell  carcinoma  ( Sq )  and  three  adenocarcinoma  (Ad) 
tissue  specimens  ( T)  and  paired  matching  normal  tissue  specimens  (A/)  were  subjected  to  Western  blot  analysis  for  the  levels  of  total  CREB  and  p-CREB  expression. 
Equal  protein  loading  was  confirmed  by  stripping  the  blots  and  reprobing  them  with  an  anti-p-actin  antibody.  The  expression  levels  of  CREB  and  p-CREB  proteins 
in  tissue  specimens  in  relation  to  that  of  the  NHTBE  cells  ( hatched  columns)  were  quantitated.  B,  qRT-PCR  analysis  of  CREB  mRNA  expression.  Total  mRNA 
from  the  NSCLC  tissue  specimens  and  paired  normal  tissue  specimens  described  in  A  was  subjected  to  qRT-PCR.  The  values  shown  are  the  ratios  of  the  CREB 
mRNA  expressed  in  tissue  specimens  to  that  expressed  in  NHTBE  cells,  with  CREB  mRNA  levels  normalized  against  the  GAPDH  mRNA  level.  The  results  are 
from  a  representative  experiment  performed  twice,  and  samples  were  run  in  triplicate.  Columns,  mean;  bars,  SE.  *,  P  <  0.05;  **,  P  <  0.01  versus  matched 
normal  tissue  with  Student’s  t  test.  C,  PCR  analysis  of  CREB  gene  copy  number.  The  genomic  DNA  extracted  from  the  NSCLC  tissue  specimens  and  paired 
normal  tissue  specimens  was  subjected  to  quantitative  PCR.  The  values  are  the  ratios  of  the  CREB  DNA  copy  numbers  in  normal  and  tumor  tissues  to  the  CREB 
DNA  copy  numbers  in  NHTBE  cells,  with  CREB  DNA  normalized  against  the  p-actin  DNA  level.  The  results  are  from  a  representative  experiment  performed  twice, 
and  samples  were  run  in  triplicate.  Columns,  mean;  bars,  SE.  *,  P  <  0.05;  **,  P  <  0.01  versus  matched  normal  tissue  with  Student’s  t  test. 


as  current  smokers  if  they  smoked  daily  at  the  date  of  diagnosis  (for 
patients)  or  interview  (for  controls),  or  as  former  smokers  if  they  had 
stopped  smoking  daily  before  those  dates. 

Cell  cultures.  NHTBE  cells  obtained  from  Cambrex  were  organotypically 
cultured  and  maintained  as  described  previously  (34-37).  Basically,  NHTBE 
cells  from  passage  2  were  seeded  at  a  density  of  1  x  105  per  insert  onto 
24-mm,  uncoated,  semipermeable  membranes  (Transwell  clear;  Costar)  in  a 
1:1  mixture  of  DMEM  (Invitrogen  Co.)  and  bronchial  epithelial  cell  basal 
medium  (Cambrex)  supplemented  with  transferrin  (10  ng/mL),  epinephrine 
(0.5  pg/mL),  insulin  (5  pg/mL),  triiodothyronine  (6.5  ng/mL),  hydrocorti¬ 
sone  (0.5  pg/mL),  EGF  (0.5  ng/mL),  bovine  pituitary  extract  (1%  w/v), 
bovine  serum  albumin  (1.5  pg/mL),  gentamicin  (10  pg/mL),  and  retinoic 
acid  (5  x  10-8  mol/L).  The  cells  were  grown  submerged  in  the  medium  for 
the  first  7  d,  after  which  time  an  air-liquid  interface  was  created.  The  cells 
were  then  cultured  in  the  air-liquid  interface  for  3  wk,  with  the  medium 
changed  every  24  h.  Fully  differentiated  28-d-old  cultures  developed 
mucociliary  phenotypes  similar  to  that  of  in  vivo  bronchial  epithelium.  In 
addition,  10  NSCLC  cell  lines  (H226,  H292,  H520,  H2170,  H1563,  H1734, 
H1975,  H2228,  A549,  and  H1703)  were  obtained  from  the  American  Type 
Culture  Collection  and  maintained  in  RPMI  1640  containing  10%  fetal 
bovine  serum  and  gentamicin  (10  pg/mL). 


Western  blot  analysis.  Western  blot  analysis  was  performed  as 
described  previously  (14)  to  measure  the  expression  of  CREB  and  p-CREB 
in  whole-cell  extracts  from  the  NHTBE  cells  and  NSCLC  cell  lines  and  from 
the  six  archived  NSCLC  specimens  and  paired  normal  lung  tissue 
specimens. 

CREB  mRNA  expression  and  CREB  gene  copy  number.  Total  RNA 
and  genomic  DNA  from  the  NHTBE  cells,  NSCLC  cells,  and  frozen  NSCLC 
specimens  were  extracted  using  the  Qiagen  RNeasy  Mini  kit  and  Blood  &  Cell 
Culture  DNA  Mini  kit  (Qiagen)  and  subjected  to  quantitative  reverse 
transcription-PCR  (qRT-PCR)  and  conventional  PCR  analysis  to  determine 
CREB  mRNA  expression  and  CREB  gene  copy  number,  as  described 
previously  (14).  The  primer  sequences  used  for  detection  of  CREB  mRNA 
in  qRT-PCR  were  as  follows:  forward,  5'-ACTGTAACGGTGCCAACTCC-3'; 
reverse,  5'-GAATGGTAGTACCCGGCTGA-3'.  The  mRNA  level  of  human 
glyceraldehyde-3-phosphate  dehydrogenase  (GAPDH)  detected  with  VIC  dye 
(Applied  Biosystems)  was  used  as  endogenous  control.  The  primer  sequences 
used  for  determination  of  the  CREB  gene  copy  number  in  conventional  PCR 
were  as  follows:  forward,  5'- AAGAGGAGACTTC AGC AC CTG-3';  reverse, 
5'-GCAAAACTGAGAAGACTTGGC-3'.  For  endogenous  control,  the  following 
p-actin  primer  sequences  were  used:  forward,  5'-AGGTCATCACCATTGG- 
CAAT-3';  reverse,  5'-AATGAGGGCAGGACTTAGCTT-3'. 
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Figure  3.  Immunohistochemical  analysis  of  CREB  and  p-CREB  expression  in  normal  and  tumor  lung  tissues.  A,  fixed  NSCLC  tissue  specimens  (26  adenocarcinoma 
and  19  squamous  cell  carcinoma)  and  adjacent  normal  bronchial  and  bronchiolar  epithelial  tissue  specimens  were  subjected  to  immunohistochemical  staining  and 
then  scored  according  to  the  criteria  mentioned  in  Materials  and  Methods.  Left,  BLiP  plots  displaying  the  distribution  of  CREB  and  p-CREB  immunostaining 
scores  in  normal  and  tumor  tissues.  Average  scores  for  CREB  are  1.37  in  tumor  versus  0.73  in  normal  (P  =  0.013)  and  for  p-CREB  are  1.96  in  tumor  versus 
1.05  in  normal  (P  =  0.0002).  Representative  images  (right)  were  captured  at  a  magnification  of  x200.  Arrows,  nuclear  CREB  and  p-CREB  immunostaining  in  the 
basal  layer  of  normal  bronchial  epithelial  and  tumor  tissue  specimens.  B,  CREB  and  p-CREB  immunostaining  of  TMA  specimens  from  a  total  of  310  patients  with 
either  adenocarcinoma  (194)  or  squamous  cell  carcinoma  (116)  are  included  in  the  analysis.  Left,  the  distributions  of  the  staining  scores  for  the  two  histologic 
types  of  NSCLC  are  presented  in  the  box  plots.  The  number  of  samples  measured  was  indicated  under  each  category.  X  marks  and  lines  inside  the  quartile  boxes 
are  means  and  medians,  respectively  (for  CREB,  0.43  and  0.23  in  adenocarcinoma/BAC  versus  0.61  and  0.45  in  squamous  cell  carcinoma;  for  p-CREB,  0.25  and 
0.03  versus  0.32  and  0.2).  P  =  0.002  for  CREB;  P  =  0.008  for  p-CREB.  Right,  representative  images  of  CREB  and  p-CREB  immunostaining  of  TMA  NSCLC  specimens. 
Magnification,  x40. 
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Table  1.  Estimation  of  overall  survival  durations  per  demographic  and  clinicopathologic  characteristics  in 
from  whom  tumor  TMA  specimens  were  obtained 

NSCLC  patients 

Characteristic 

Variable  estimate  ±  SE 

P 

Hazards  ratio 

Univariate  Cox  proportional  hazards  regression  models 

Age 

0.04  ±  0.01 

0.0002 

1.04 

Sex  (male  vs  female) 

0.34  ±  0.21 

0.1000 

1.40 

Race  (white  vs  other) 

-0.31  ±  0.34 

0.3500 

0.73 

Smoking  status 

Former  vs  never 

0.24  ±  0.25 

0.3300 

1.27 

Current  vs  never 

0.03  ±  0.29 

0.9300 

1.03 

Histologic  subtype  (squamous  cell  carcinoma  vs  adenocarcinoma/BAC) 

0.65  ±  0.21 

0.0020 

1.92 

CREB  expression 

0.28  ±  0.17 

0.1000 

1.33 

p-CREB  expression 

0.66  ±  0.23 

0.0040 

1.93 

Pathologic  T  classification  (T2  +  T3  +  T4  vs  Tl) 

0.97  ±  0.27 

0.0002 

2.63 

Pathologic  N  classification  (N1  +  N2  +  N3  vs  NO) 

0.73  ±  0.23 

0.0010 

2.08 

Pathologic  M  classification  (Ml  vs  M0) 

1.05  ±  0.72 

0.1400 

2.87 

Pathologic  TNM  stage 

II  vs  I 

0.65  ±  0.26 

0.0100 

1.92 

III  +  IV  vs  I 

0.86  ±  0.29 

0.0030 

2.37 

Multivariate  Cox  proportional  hazards  regression  models 

Model  A:  CREB 

Age 

0.04  ±  0.01 

0.0002 

1.04 

CREB  expression 

0.41  ±  0.18 

0.0200 

1.51 

Pathologic  T  classification  (T2  +  T3  +  T4  vs  Tl) 

0.84  ±  0.28 

0.0030 

2.31 

Pathologic  N  classification  (N1  +  N2  +  N3  vs  NO) 

0.47  ±  0.24 

0.0490 

1.60 

Model  B:  p-CREB 

Age 

0.04  ±  0.01 

0.0005 

1.04 

p-CREB  expression 

0.59  ±  0.23 

0.0100 

1.80 

Pathologic  T  classification  (T2  +  T3  +  T4  vs  Tl) 

0.69  ±  0.28 

0.0100 

2.00 

Pathologic  N  classification  (N1  +  N2  +  N3  vs  NO) 

0.47  ±  0.24 

0.0500 

1.59 

Immunohistochemical  analysis.  Immunohistochemical  analysis  of  the 
NSCLC  and  adjacent  normal  bronchial  and  bronchiolar  epithelial  tissue 
specimens  and  of  the  TMA  NSCLC  tissue  specimens  was  performed  using 
anti-CREB  and  anti-p-CREB  antibodies  (Upstate)  at  a  dilution  of  1:100 
according  to  the  manufacturer’s  instructions.  Immunostaining  was 
visualized  using  the  Histostain-Bulk-SP  kit  and  the  AEC  red  substrate  kit 
(Zymed  Laboratories).  Immunohistochemical  staining  without  a  primary 
antibody  was  performed  as  a  negative  control.  Distinct  nuclear  immunos¬ 
taining  for  CREB  and  p-CREB  was  quantified  by  an  experienced  lung  cancer 
pathologist  (I.I.W.)  under  a  light  microscope.  The  observer  quantified 
immunohistochemical  expression  in  a  blindly  fashion  regarding  the  clinical 
features  of  the  cases  (38,  39).  In  each  specimen,  up  to  1,000  tumor  and 
epithelial  cells  were  examined  using  a  X20  magnification  objective.  The 
intensity  of  CREB  and  p-CREB  immunostaining  was  graded  on  a  scale  of  0 
to  3,  with  0  indicating  no  staining,  1  indicating  weak  staining,  2  indicating 
moderate  staining,  and  3  indicating  strong  staining.  The  extent  of  positive 
immunoreactivity  for  CREB  and  p-CREB  (0-1;  1  =  100%)  was  calculated  as 
the  percentage  of  cells  that  had  nuclear  staining  for  CREB  or  p-CREB. 
Scoring  of  immunohistochemical  staining  in  each  specimen  was  deter¬ 
mined  as  the  product  of  positive  immunostaining  intensity  (0-3)  and 
positive  immunoreactivity  extent  (0-1). 

Statistical  analysis.  A  mixed-effect  general  linear  model  was  used  to 
assess  the  differences  in  CREB  and  p-CREB  expression  in  the  normal  lung 
and  NSCLC  tissue  specimens.  The  Kruskal-Wallis  test  and  Wilcoxon  rank 
sum  test  were  used  to  assess  the  relationships  between  CREB  and  p-CREB 
expression  in  the  TMA  specimens  and  patients’  demographic  and 
clinicopathologic  characteristics.  Cox  proportional  hazards  regression 
models  were  used  to  assess  the  effect  of  the  CREB  and  p-CREB 
immunostaining  scores  on  overall  survival  duration  (time  from  surgery  to 
death  of  any  cause).  Survival  curves  were  determined  using  the  Kaplan- 
Meier  product  limit  estimates,  and  differences  in  probability  of  survival 
between  groups  were  assessed  statistically  using  the  log-rank  test.  The  need 


for  transformation  of  predictive  variables  in  the  Cox  proportional  hazards 
regression  models  was  assessed  using  martingale  residual  plots.  Predictive 
variables  with  P  values  of  <0.10  for  the  univariate  Cox  proportional  hazards 
regression  model  were  included  in  a  multivariate  model.  In  this  multivariate 
model,  backward  elimination  with  a  P  value  cutoff  of  0.05  was  used;  any 
previously  deleted  variables  were  then  allowed  to  reenter  the  final  model  if 
P  <  0.05. 

Results 

Higher  expression  of  CREB  and  p-CREB  in  NSCLC  cell  lines 
than  in  NHTBE  cells.  Western  blot  analysis  showed  that  most  of 
the  NSCLC  cell  lines  had  higher  levels  of  CREB  (9  of  10  cell  lines) 
and  p-CREB  (7  of  10  cell  lines)  protein  expression  than  did  the 
NHTBE  cells  (Fig.  L4).  qRT-PCR  analysis  showed  a  similar  pattern 
in  CREB  mRNA  expression:  about  2-fold  to  12-fold  higher  levels  in  6 
of  10  NSCLC  cell  lines  than  in  NHTBE  cells  ( P  <  0.05  in  4  cell  lines; 
P  <  0.01  in  2  cell  lines;  Fig.  IB)-,  these  6  cell  lines  also  had  the 
highest  p-CREB  protein  expression  levels.  Unlike  CREB  protein  and 
mRNA  expression,  PCR  analysis  showed  that  the  CREB  gene  copy 
number  was  significantly  increased  in  only  2  of  the  10  NSCLC  cell 
lines  (P  <  0.05  for  both  cell  lines;  Fig.  1C).  These  data  clearly  show 
that  CREB  was  overexpressed  and  highly  activated  in  most  of  the 
NSCLC  cell  lines.  Moreover,  CREB  overexpression  in  these  cell  lines 
mainly  resulted  from  increased  CREB  gene  transcription  rather 
than  an  amplified  CREB  gene  copy  number. 

Higher  expression  of  CREB  in  frozen  NSCLC  specimens  than 
in  adjacent  normal  lung  tissue  specimens.  CREB  protein  was 
overexpressed  (by  50-92%)  and  more  highly  activated  (by  18-119%) 
in  frozen  NSCLC  tissue  specimens  than  in  adjacent  normal  tissue 
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specimens  according  to  our  Western  blot  analysis  (Fig.  2A).  Among 
the  six  frozen  specimens,  qRT-PCR  analysis  showed  that  CREB 
mRNA  expression  was  significantly  higher  in  all  three  squamous 
cell  specimens  (by  73-95%)  and  in  two  of  the  three  adenocarcinoma 
specimens  (by  65%  and  73%)  than  in  the  adjacent  normal  tissue 
specimens  (P  <  0.05  for  all  five  specimens;  Fig.  2 B).  PCR  analysis 
revealed  that  only  one  squamous  cell  carcinoma  tissue  specimen 
and  one  adenocarcinoma  tissue  specimen  had  a  significantly 
increased  CREB  gene  copy  number  when  compared  with  normal 
tissue  specimens  ( P  <  0.05  for  both  pairs  of  specimens;  Fig.  2 C). 

The  overexpression  of  CREB  and  p-CREB  in  these  frozen  NSCLC 
tissue  specimens  was  concordant  with  our  results  in  the  NSCLC 
cell  lines.  Taken  together,  our  cell  line  and  frozen  tissue  specimen 
results  showed  that  CREB  overexpression  occurred  primarily  at  the 
gene  transcription  level  but  in  some  cases  could  have  resulted  from 
an  amplified  CREB  gene  copy  number. 

Higher  expression  of  CREB  in  paraffin-embedded  NSCLC 
specimens  than  in  adjacent  normal  lung  tissue  specimens. 


Immunohistochemical  analysis  of  CREB  and  p-CREB  expression  in 
the  45  whole  paraffin-embedded  NSCLC  specimens  and  adjacent 
normal  bronchial  and  bronchiolar  epithelial  tissue  specimens 
showed  stronger  nuclear  staining  for  CREB  and  p-CREB  in  both 
adenocarcinoma  and  squamous  cell  carcinoma  tissue  than  in 
normal  tissue  (Fig.  3 A).  The  distributions  of  the  staining  scores  for 
the  tumor  tissue  and  normal  epithelium  are  shown  in  Fig.  3 A. 
Statistical  analysis  showed  significantly  higher  immunostaining 
scores  for  both  CREB  (1.37  versus  0.73;  P  =  0.013)  and  p-CREB  (1.96 
versus  1.05;  P  =  0.0002)  in  tumor  tissue  than  in  normal  tissue. 

Association  of  CREB  and  p-CREB  expression  with  histologic 
NSCLC  subtype  and  patients’  demographic  and  clinicopatho- 
logic  characteristics.  We  performed  immunohistochemical  ana¬ 
lysis  of  CREB  and  p-CREB  expression  in  the  310  NSCLC  TMAs  to 
assess  potential  associations  of  CREB  and  p-CREB  overexpression 
with  the  patients’  demographic  and  clinicopathologic  character¬ 
istics  and  histologic  subtypes  of  NSCLC  (listed  in  Supplementary 
Table  SI).  Representative  images  of  the  staining  of  CREB  and 


Figure  4.  Kaplan-Meier  curves  of  overall  survival  duration  stratified  according  to  CREB  (top)  and  p-CREB  ( bottom )  expression  in  the  entire  cohort  (left)  and  in  never 
smokers  (right).  A  total  of  310  patients  had  information  available  for  the  survival  analysis,  94  of  whom  died.  The  median  overall  survival  duration  was  6.5  y,  and 
the  median  follow-up  duration  was  3.2  y.  The  cutoff  points  for  the  CREB  and  p-CREB  immunostaining  scores  were  identified  using  martingale  residual  plots  with 
respect  to  the  overall  survival  duration.  The  curves  are  labeled  with  the  corresponding  immunostaining  scores. 
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Table  2.  Estimation  of  overall  survival  durations  per  smoking  status  in  NSCLC  patients  from  whom  tumor  TMA  specimens 
were  obtained  using  univariate  Cox  proportional  hazards  regression  models 


Smoking  status 

Variable  estimate 

SE 

P 

Hazards  ratio 

Never 

CREB 

0.55 

0.24 

0.02 

1.73 

p-CREB 

0.99 

0.43 

0.02 

2.69 

Former 

CREB 

0.24 

0.28 

0.39 

1.28 

p-CREB 

0.75 

0.30 

0.01 

2.13 

Current 

CREB 

-0.46 

0.56 

0.41 

0.63 

p-CREB 

-0.03 

0.60 

0.96 

0.97 

p-CREB  in  the  TMA  specimens  are  shown  in  Fig.  3 B.  According  to 
the  Wilcoxon  rank  sum  test,  the  CREB  and  p-CREB  immunostain- 
ing  scores  were  significantly  higher  in  the  squamous  cell  carcinoma 
specimens  than  in  the  adenocarcinoma  and  BAC  specimens 
(Fig.  3 B,  CREB:  0.61  versus  0.43,  P  =  0.002;  p-CREB:  0.32  versus 
0.25,  P  =  0.008).  The  levels  of  immunohisto chemical  staining  of 
CREB  and  p-CREB  were  lower  in  the  TMA  specimens  than  in  the 
paraffin-embedded  whole  NSCLC  tissue  specimens.  This  pheno¬ 
menon  probably  resulted  from  larger  tissue  areas  and  stronger 
immunostaining  in  the  whole  tissue  specimens  than  in  the  TMA 
specimens.  We  detected  no  significant  differences  in  CREB  or 
p-CREB  expression  between  other  demographic  and  clinicopath- 
ologic  subpopulations  (Table  1). 

Effects  of  CREB  and  p-CREB  overexpression  on  overall 
survival  duration.  The  Kaplan-Meier  survival  curves  shown  in 
Fig.  4  show  that  overexpression  of  CREB  or  p-CREB  was  related  to  a 
lower  probability  of  survival.  Overexpression  of  CREB  [immunos¬ 
taining  score  >  0.9  (E  /  N  =  25/56)]  or  p-CREB  [immunostaining 
score  >  0.7  (E  /  N  =  23/45)]  was  significantly  associated  with 
decreased  overall  survival  duration  in  patients  with  NSCLC  (P  = 
0.02  and  0.002,  respectively,  log-rank  test).  We  also  used  univariate 
Cox  proportional  hazards  regression  models  to  determine  the 
effects  of  covariates  on  overall  survival  duration.  Factors  that 
significantly  affected  overall  survival  were  age,  histologic  subtype, 
p-CREB  expression,  pathologic  T  classification,  pathologic  N 
classification,  and  pathologic  tumor-node-metastasis  (TNM)  stage 
(Table  1).  Multivariate  Cox  proportional  hazards  regression  model 
analysis  showed  that  the  expression  of  both  CREB  and  p-CREB  was 
significantly  associated  with  decreased  overall  survival  after 
accounting  for  the  effects  of  age  and  pathologic  T  and  N 
classification  (CREB:  P  =  0.02;  p-CREB:  P  =  0.01).  One-unit 
increases  in  the  CREB  and  p-CREB  immunostaining  scores 
increased  the  risk  of  death  by  51%  and  80%,  respectively  (Table  1). 

Effects  of  CREB  and  p-CREB  overexpression  on  survival 
duration  according  to  smoking  status.  The  inverse  relationship 
between  the  level  of  CREB  and  p-CREB  expression  and  the  survival 
duration  was  significantly  dependent  on  smoking  status  in  the  310 
patients  from  whom  the  TMA  NSCLC  specimens  were  obtained. 
Overexpression  of  CREB  and  p-CREB  significantly  lowered  the 
probability  of  survival  in  never  smokers  [CREB  immunostaining 
score  >  0.5  (E  /  N  =  16/38);  p-CREB  immunostaining  score  >  0.5  (E  / 
N  =  10/17);  Fig.  4,  right]  but  not  in  former  or  current  smokers  (data 
not  shown).  In  addition,  univariate  Cox  proportional  hazards 
regression  model  analysis  showed  that  both  CREB  and  p-CREB 
immunostaining  scores  were  significantly  inversely  correlated  with 


overall  survival  duration  in  never  smokers,  increasing  the  risk  of 
death  by  73%  (P  =  0.02)  and  169%  (P  =  0.02),  respectively.  We  also 
observed  this  tendency  in  former  smokers,  as  both  CREB  and 
p-CREB  immunostaining  scores  were  inversely  correlated  with 
overall  survival  duration,  increasing  the  risk  of  death  by  28%  (P  = 
0.39)  and  113%  (P  =  0.01),  respectively.  In  contrast,  in  current 
smokers,  neither  CREB  nor  p-CREB  immunostaining  scores 
affected  overall  survival  duration  ( P  =  0.41  and  0.96,  respectively; 
Table  2). 

Discussion 

Our  study  of  the  role  of  CREB  in  the  development  and 
pathogenesis  of  NSCLC  showed  that  CREB  and  p-CREB  are 
overexpressed  in  patients  with  NSCLC  and  that  this  overexpression 
is  associated  with  a  negative  prognosis  in  never  smokers  with  this 
disease.  CREB  expression  levels  were  higher  and  CREB  was  more 
highly  activated  constitutively  in  NSCLC  cell  lines  than  in  NHTBE 
cells.  CREB  and  p-CREB  were  also  expressed  at  higher  levels  in 
frozen  NSCLC  tumor  specimens  than  in  paired  normal  tissue 
specimens.  This  overexpression  of  CREB  protein  resulted  primarily 
from  transcriptional  overexpression  of  CREB  mRNA  rather  than 
from  amplification  of  the  CREB  gene  copy  number.  These  results 
are  consistent  with  a  recent  report  showing  that  CREB  expression 
is  up-regulated  at  both  the  protein  and  mRNA  levels  in  primary 
acute  myeloid  leukemia  cells  compared  with  that  in  normal  blood 
cells  (9).  However,  the  report  indicated  that  CREB  overexpression 
was  associated  mainly  with  an  amplified  CREB  gene  copy  number 
(in  three  of  four  patients  with  acute  myeloid  leukemia),  whereas  we 
found  that  CREB  overexpression  occurred  mainly  at  the  gene 
transcription  level  (and  possibly  as  a  result  of  an  increased  CREB 
gene  copy  number  in  a  few  cases).  This  discrepancy  implies 
mechanistic  variation  in  CREB  overexpression  in  different  cancer 
types.  Immunohistochemical  analysis  of  slides  containing  whole 
sections  of  NSCLC  and  adjacent  normal  bronchial  or  bronchiolar 
epithelial  tissue  specimens  confirmed  that  expression  of  both 
CREB  and  p-CREB  was  significantly  higher  in  the  tumor  specimens 
than  in  the  normal  tissue  specimens.  In  addition,  the  results  of  our 
immunohistochemical  study  on  NSCLC  TMAs  showed  that  CREB 
and  p-CREB  immunostaining  scores  were  significantly  higher  in  the 
squamous  cell  carcinoma  specimens  than  in  the  adenocarcinoma 
specimens.  Taken  together,  these  results  clearly  indicate  over¬ 
expression  of  CREB  in  NSCLC. 

In  addition,  the  Kaplan-Meier  survival  curves  revealed  that 
overexpression  of  both  CREB  and  p-CREB  was  significantly 
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associated  with  decreased  overall  survival  durations  in  the  310 
patients  from  whom  the  TMA  NSCLC  specimens  were  obtained. 
This  observed  effect  of  CREB  and  p-CREB  overexpression  on 
survival  was  mainly  contributed  by  the  never-smoker  portion  of 
patients.  The  survival  durations  in  these  patients  were  strongly 
influenced  by  the  levels  of  CREB  and  p-CREB  expression,  whereas 
the  durations  in  former  and  current  smokers  were  less  or  not 
affected  by  them.  These  data  suggest  that  the  expression  level  of 
both  CREB  and  p-CREB  is  a  useful  biomarker  for  predicting 
survival  duration,  and  therefore,  CREB  could  be  a  therapeutic 
target  for  never  smokers  with  NSCLC.  Prognosis  for  and  treatment 
outcome  of  NSCLC  are  often  in  favor  of  never  smokers  (40,  41).  The 
reason  why  overexpression  of  CREB  and  p-CREB  is  a  negative 
prognostic  factor  in  never  smokers  but  not  in  former  or  current 
smokers  is  unclear.  One  possible  explanation  is  that  survival  of 
cancer  cells  depends  substantially  on  CREB  activity  in  never 
smokers,  less  so  in  former  smokers,  and  not  at  all  in  current 
smokers.  Alternatively,  former  and  current  smokers  may  have  other 
confounding  factors  that  predominate  over  CREB  activity  in 
affecting  overall  survival. 

Cancers  in  ever  smokers  may  use  multiple  (proto)oncogenic 
pathways  for  growth  and  survival.  Although  tobacco  smoking  is  the 
leading  cause  of  lung  cancer,  several  studies  have  shown  that  the 
biology  of  lung  cancer  differs  between  never  smokers  and  ever 
smokers  [see  Sun  and  colleagues  (46)  for  a  comprehensive  review; 
refs.  41-45].  For  examples,  some  well-characterized  mutations  of 
EGFR  are  more  frequently  detected  in  a  subset  of  lung  cancer 
patients  who  are  never  smokers,  whereas  mutations  in  KRas  are 
more  prevalent  in  smokers  and  such  mutation  statuses  are 
mutually  exclusive  in  each  given  set  of  lung  cancers  (42,  46).  CREB 
is  activated  by  two  downstream  pathways  of  EGFR  signaling, 
namely,  Ras-Raf-ERK-RSK  (26,  47)  and  PI3K-Akt  (48)  pathways,  but 
it  is  not  clear  whether  these  pathways  differentially  regulate  CREB 
expression  and  activity.  Further  studies  are  required  to  determine 
whether  CREB  plays  a  differential  role  in  the  pathogenesis  of  lung 
cancers  between  smokers  and  never  smokers. 

Of  note  is  that  CREB  may  also  play  a  significant  role  in  the 
progression  of  lung  cancer  at  its  early  stage.  Studies  showed  that 
the  transcriptional  activity  of  CREB  mediated  tobacco  smoke - 
stimulated  overexpression  of  amphiregulin  (49),  which  is  associ¬ 
ated  with  poor  prognosis  in  patients  with  NSCLC,  indicating  that 
CREB  may  play  an  important  role  in  the  early  stage  of  lung 
carcinogenesis  in  smokers  (50).  Recent  studies  examining  the 
role  of  CREB  in  lung  cancer  have  shown  an  elevated  expression  of 
p-CREB  in  lung  tumors  generated  in  insulin-like  growth  factor 
II-overexpressing  transgenic  mice  and  that  CREB  played  an 
important  role  in  survival  of  lung  cancer  cell  lines  (51,  52). 
Further  studies  are  warranted  to  molecularly  characterize  the 


differential  role  of  CREB  in  smokers  and  never  smokers  with 
NSCLC  tumors. 

CREB  overexpression  may  lead  to  up-regulation  of  genes  and 
activation  of  signaling  pathways  that  support  lung  tumor  growth 
and  survival.  To  test  this  hypothesis,  we  have  gathered  data 
indicating  that  inactivation  of  CREB  or  reduction  of  CREB 
expression  (via  the  expression  of  a  dominant  repressor  of  CREB 
or  small  interfering  RNA  against  CREB)  reduces  the  expression  of 
antiapoptotic  genes  and  consequently  inhibits  the  growth  and 
survival  of  NSCLC  cells  (53).  The  present  study  showed  that 
increased  expression  of  CREB  and  p-CREB  correlates  with 
decreased  overall  survival  durations  in  lung  cancer  patients, 
suggesting  that  sustained  overexpression  of  CREB  in  malignant 
cells  supports  the  growth  and  survival  of  tumor  cells. 

In  summary,  the  present  study  provided  important  initial 
insights  into  the  role  of  CREB  in  the  development  and  pathogenesis 
of  NSCLC.  First,  CREB  was  overexpressed  and  highly  active 
constitutively  in  NSCLC  cell  lines  and  banked  NSCLC  specimens. 
Second,  CREB  overexpression  seemed  to  be  more  attributable  to 
increased  CREB  mRNA  transcription  than  CREB  gene  amplifica¬ 
tion,  suggesting  that  CREB  overexpression  is  a  correctable 
therapeutic  target  for  patients  with  NSCLC.  Third,  overexpression 
of  both  CREB  and  p-CREB  was  independently  correlated  with 
significantly  decreased  overall  survival  durations  in  never  smokers 
but  not  in  current  or  former  smokers  with  NSCLC.  However, 
further  extensive  studies  are  required  to  determine  whether  CREB 
plays  a  differential  role  in  the  development  of  lung  cancer  in  never 
and  ever  smokers.  To  the  best  of  our  knowledge,  this  study  is  the 
first  to  provide  evidence  that  CREB  overexpression  and  p-CREB 
overexpression  are  negative  prognostic  factors  in  never  smokers 
with  NSCLC.  Therefore,  targeting  CREB,  such  as  with  the  use  of 
CREB  inhibitors,  may  be  a  preventive  strategy  for  individuals  at 
high  risk  for  NSCLC  and/or  a  targeted  therapeutic  strategy  for  this 
disease,  especially  among  never  smokers. 
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Abstract 

Detection  of  cancer  cells  at  early  stages  could  poten¬ 
tially  increase  survival  rates  in  cancer  patients.  Aber¬ 
rant  promoter  hypermethylation  is  a  major  mechanism 
for  silencing  tumor  suppressor  genes  in  many  kinds  of 
human  cancers.  A  recent  report  from  our  laboratory 
described  the  use  of  quantitative  methylation-specific 
PCR  assays  for  discriminating  patients  with  lung 
cancer  from  those  without  lung  cancer  using  lung 
biopsies  as  well  as  sputum  samples.  TCF21  is  known  to 
be  essential  for  differentiation  of  epithelial  cells 
adjacent  to  mesenchyme.  Using  restriction  landmark 
genomic  scanning,  a  recent  study  identified  TCF21  as 
candidate  tumor  suppressor  at  6q23-q24  that  is  epige- 
netically  inactivated  in  lung  and  head  and  neck  can¬ 
cers.  Using  DNA  sequencing  technique,  we  narrowed 


down  a  short  CpG-rich  segment  (eight  specific  CpG 
sites  in  the  CpG  island  within  exon  1)  of  the  TCF21 
gene,  which  was  unmethylated  in  normal  lung  epithe¬ 
lial  cells  but  predominantly  methylated  in  lung  cancer 
cell  lines.  We  specifically  targeted  this  short  CpG-rich 
sequence  and  developed  a  quantitative  methylation- 
specific  PCR  assay  suitable  for  high-throughput  anal¬ 
ysis.  We  showed  the  usefulness  of  this  assay  in 
discriminating  patients  with  lung  cancer  from  those 
without  lung  cancer  using  biopsies  and  sputum  sam¬ 
ples.  We  further  showed  similar  applications  with 
multiple  other  malignancies.  Our  assay  might  have 
important  implications  in  early  detection  and  surveil¬ 
lance  of  multiple  malignancies.  (Cancer  Epidemiol 
Biomarkers  Prev  2008;17(4):995-1000) 


Introduction 

Transcriptional  inactivation  of  CpG  island-containing 
promoters  of  tumor  suppressor  genes  by  DNA  hyper¬ 
methylation  has  been  well  documented  in  many  human 
cancers  (1).  Methylation  of  specific  CpG  residues  within 
a  CpG  island  of  a  tumor  suppressor  gene  may  reflect 
gene  silencing  and  indicate,  at  least  in  part,  the  expres¬ 
sion  status  of  the  gene.  Gene  promoter  hypermethylation 
potentially  provides  a  noninvasive  screen  for  early 
cancer  detection  (2). 

Methylation  analyses  have  been  conducted  using 
conventional  methodologies  such  as  COBRA,  direct 
sequencing,  or  methylation-specific  PCR  of  the  bisul- 
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fite-treated  DNA.  However,  these  methods  are  labor 
intensive,  amenable  to  false-positive  results,  and  not 
suitable  for  high-throughput  analysis.  Methyl  Light 
assays  are  not  only  highly  specific,  sensitive,  and  repro¬ 
ducible  but  also  are  nonsubjective  and  allow  for  rapid 
analysis  of  many  samples  at  multiple  gene  loci  (3,  4). 
Recent  publications  have  shown  the  presence  of  pro¬ 
moter  hypermethylation  of  various  genes  in  clinical 
specimens  containing  exfoliated  tumor  cells  (such  as 
malignant  effusions,  sputum,  serum,  etc.;  refs.  5-8). 
Recently,  we  reported  quantitative  methylation-specific 
PCR  analysis  of  sputum  DNA  based  on  a  panel  of 
methylated  genes  (9).  The  panel  separated  patients  with 
lung  cancer  from  those  without  lung  cancer,  showing 
the  potential  of  the  quantitative  methylation-specific 
PCR  analysis  of  sputum  as  an  effective  biomarker  assay. 
Subsequently,  we  decided  to  explore  additional  novel 
markers  that  might  further  improve  this  assay. 

TCF21  is  known  to  be  essential  for  differentiation  of 
epithelial  cells  adjacent  to  mesenchyme  (10).  Using  res¬ 
triction  landmark  genomic  scanning.  Smith  et  al.  (11) 
identified  TCF21  as  a  candidate  tumor  suppressor  at 
6q23-q24  that  is  epigenetically  inactivated  in  lung  and 
head  and  neck  cancers.  In  our  article,  using  DNA 
sequencing,  we  examined  lung  cancer  and  bronchial 
epithelial  cell  lines  for  methylation  of  the  CpG  island 
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within  exon  1  of  the  TCF21  gene  (GenBank  accession  no. 
AF047419).  We  identified  a  short  CpG-rich  segment  that 
was  unmethylated  in  normal  bronchial  cells  but  pre¬ 
dominantly  methylated  in  lung  cancer  cell  lines.  We 
decided  to  develop  a  high-throughput  quantitative  bio¬ 
assay  to  determine  prevalence  of  methylation  in  clinical 
samples  and  evaluate  its  potential  as  cancer  biomarker 
assay  applicable  for  multiple  cancer  types. 

Materials  and  Methods 

Surgically  resected  non-small  lung  and  bladder  cancers 
and  their  adjacent  nonmalignant  tissues  were  obtained 
from  the  University  of  Texas  M.  D.  Anderson  Cancer 
Center.  Surgically  resected  breast  cancers  and  their 
adjacent  nonmalignant  breast  tissues  were  obtained  from 
Parkland  Memorial  Hospital.  Leukemia  cases,  all  acute 
myeloid  leukemia,  were  obtained  from  Parkland  Memo¬ 
rial  Hospital.  Peripheral  blood  mononuclear  cells  were 
obtained  from  healthy  individuals  with  a  family  history 
of  cancer.  We  established  all  cell  lines  used  in  this  study. 
Sputum  samples  were  obtained  from  13  patients  with 
non -small  cell  lung  cancer  (NSCLC)  and  25  individuals 
with  chronic  obstructive  pulmonary  disease;  all  were 
heavy  smokers  without  lung  cancer  in  the  Canisius 
Wilhelmina  Hospital.  Three-day  pooled  early  morning 
sputum  samples  were  collected  in  Saccomanno's  fixative 
(2%  polyethylene  glycol  in  50%  ethanol).  Informed 
consent  and  institutional  review  board  permission  were 
obtained  at  each  site. 

Gene  Expression  in  Cell  Lines.  Gene  expression 
studies  were  conducted  as  described  previously  (11) 
with  some  modifications.  Semiquantitative  real-time  PCR 
was  carried  out  by  using  QuantiTect  SYBR  Green  PCR 
kit.  The  expression  levels  were  quantitated  using  com¬ 
parative  Ct  method.  In  case  of  both  TCF21  and  WNT4, 
means  of  expression  values  for  the  two  human  bronchial 
epithelial  cells  (HBEC)  were  considered  to  have  a  value 
of  1.  5-Aza-2-deoxcytidine  treatment  of  lung  cell  lines 
(H1299,  H2887,  HCC95,  and  H661)  was  done  using 
protocol  as  described  previously  (12). 

DNA  Extraction  and  Bisulfite  Modification  and 
DNA  Sequencing.  Genomic  DNA  was  extracted  from 
cell  lines,  primary  tumors  and  nonmalignant  cells, 
sputum  samples  as  described  previously  (9).  Sodium 
bisulfite  treatment  was  done  as  described  previously 
(13).  The  modified  DNA  was  used  as  a  template  for 
quantitative  PCR  (qPCR)  analysis.  The  DNA  sequencing 
was  carried  out  as  described  previously  using  Applied 
Biosystems  prism  dye  terminator  cycle  sequencing 
method  (Perkin-Elmer;  ref.  14).  Primers  were  designed 
to  exclude  CG  sites,  rendering  DNA  amplification 
independent  of  the  methylation  status:  forward  5- 
ATGTGGAGGATTTTTAAGAGGT-3'  and  reverse  5- 
CTAAAAAAAACCTTACTCAACACTC-3'.  The  sequen¬ 
ces  were  confirmed  by  sequencing  in  both  directions. 

Quantitative  Real-time  PCR  Analysis.  qPCR  analysis 
was  done  using  the  Chromo4  MJ  Research  Real-time  PCR 
System.  Sodium  bisulfite -treated  genomic  DNA  was 
amplified  by  fluorescence-based  real-time  methylation- 
specific  PCR  using  TaqMan  technology  as  described 
previously  (9,  15,  16).  In  brief,  primers  (forward  5- 
CGAGGAGAGTTTTAATTGCGAGA-3'  and  reverse  5- 


CCTAACTAACCCCGCTCAAAAAA-3')  and  probe  (5- 
FAM-TAGAAGGGTCGCGGCGGTTTGGBHQ-1-3') 
were  designed  to  specifically  amplify  bisulfite-converted 
DNA  within  the  region  of  the  test  genes  that  was 
differentially  methylated  between  expression-positive 
and  expression-negative  cell  lines  (Fig.  1).  The  non- 
methylated  form  of  MYOD1  was  used  as  an  internal 
reference  standard  (9).  The  sputum  DNA  samples  were 
coded  and  shipped  from  The  Netherlands  and  analyzed 
in  Dallas  in  a  blinded  fashion.  Some  of  the  lung  cancer 
DNA  was  also  independently  analyzed  by  three  inves¬ 
tigators  (N.S.,  V.S.,  and  C.P.)  in  a  blinded  fashion.  The 
data  from  all  the  three  investigators  were  in  close 
agreement  (data  not  shown). 

Nested  qPCR  protocol  was  accomplished  by  doing 
two  rounds  of  PCR  as  reported  previously  (17).  In  the 
first  round,  external  primers  used  for  DNA  sequencing 
were  used.  The  successful  amplification  in  the  first  round 
was  confirmed  through  agarose  gel  electrophoresis.  The 
second  round  was  carried  on  the  500  x  dilution  of  the 
first  round  reaction  using  the  probe  and  primer  sets  as  in 
quantitative  methylation-specific  PCR  protocol  described 
above. 

Statistical  Analysis.  The  receiver  operating  character¬ 
istic  curves,  a  plot  of  the  sensitivity  versus  specificity 
across  all  possible  cutoff  values,  were  used  to  identify 
the  accuracy  of  a  marker  in  discriminating  cancer  from 
nonmalignant  tissue.  The  quantitative  methylation  data 
for  the  gene  were  correlated  with  tumor  stage  using  the 
Mann-Whitney  U  test,  which  does  not  require  paramet¬ 
ric  assumption  on  the  distribution  of  quantitative 
methylation.  Statistical  differences  between  groups  were 
examined  using  Fisher's  exact  test.  P  values  <0.05  were 
considered  significant. 

Results 

We  sequenced  the  CpG  island  within  exon  1  of  the 
TCF21  gene  from  10  lung  cancer  cell  lines  and  2  HBECs 
(18)  and  also  analyzed  them  for  gene  expression.  The 
results  showed  that  all  the  19  CpG  sites  were  methylated 
in  9  of  10  lung  cancer  cell  lines,  whereas  8  specific  CpG 
sites  of  the  19  were  unmethylated  only  in  H661  (a  cancer 
cell  line)  and  in  the  HBECs  (Fig.  1).  All  methylation- 
positive  cancer  cell  lines  were  negative  for  TCF21 
expression  (relative  expression  <0.50).  H661  and  HBECs 
were  negative  for  methylation  and  positive  for  TCF21 
expression  (relative  expression  >1;  Fig.  1).  5-Aza-2'- 
deoxycytidine  treatment  to  four  lung  cell  lines  (HI 299, 
H2887,  HCC95,  and  H661)  resulted  into  reactivation  of 
TCF21  expression  in  all  the  cell  lines,  except  H661,  which 
was  positive  for  TCF21  expression  in  untreated  popula¬ 
tion  (data  not  shown).  The  results  suggest  that  methyl¬ 
ation  of  the  specific  sites  is  related  to  gene  silencing. 
However,  it  is  possible  that  other  factors  in  addition  to 
methylation  (19)  might  also  contribute  to  gene  silencing 
in  these  lung  cancer  cell  lines.  We  examined  WNT4 
expression  for  two  reasons.  Our  preliminary  studies 
showed  consistent  loss  of  its  expression  in  lung  cancer 
cell  lines  compared  with  HBECs.  Secondly,  the  gene  was 


8  Unpublished  data. 
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0  Methylated  CpG  site  A=  TCF21  methylation 

O  UnMethylated  CpG  site  B=  TCF21  expression 
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Figure  1.  Direct  sequencing  of  the  bisulfite  PCR  product  from  the  CpG  island  within  exon  1  of  the  TCF21  gene  for  10  lung  cancer 
cell  lines  and  2  normal  lung  epithelial  cell  lines  (HBEC2  and  HBEC3).  Closed  circles,  methylated  CpG  sites;  open  circles, 
unmethylated  CpG  sites.  The  expression  levels  were  quantitated  using  comparative  Ct  method.  In  both  TCF21  and  WNT4  expression 
analysis,  means  of  expression  values  for  the  two  HBECs  were  considered  to  have  a  value  of  1.  (expression  positive,  relative 
expression  >1;  expression  negative,  relative  expression  <0.50).  Sequencing  and  expression  analyses  were  conducted  as  described  in 
Materials  and  Methods.  Arrows,  locations  of  primers;  open  box,  location  of  the  probe. 


also  shown  to  be  induced  significantly  when  the  lung 
cancer  cells  were  transfected  with  TCF21  (11).  Interest¬ 
ingly,  all  the  lung  cancer  cell  lines  (except  H661)  were 
negative  for  WNT4  expression  (relative  expression  <0.50; 
Fig.  1).  Based  on  all  above  observations,  the  specific  CpG 
region  was  considered  a  potential  target  to  develop  an 
assay  to  discriminate  between  cancer  cells  and  normal 
cells. 

We  tested  the  qPCR  assay  for  methylation  of  TCF21  by 
analysis  of  40  NSCLC  tumors  (22  adenocarcinoma  and 
18  squamous  carcinomas)  and  their  adjacent  nonmalig- 
nant  lung  tissues.  Additionally,  peripheral  blood  mono¬ 
nuclear  cells  from  12  healthy  subjects  recruited  for 
genetic  epidemiology  studies  were  also  quantitatively 
analyzed.  Figure  2A  shows  the  quantitative  methylation 
data  for  NSCLC  and  their  adjacent  nonmalignant  lung 
tissue.  We  found  aberrant  methylation  of  TCF21  in  30  of 
40  (75%)  of  primary  lung  tumors  (QR,  0-129.67)  and  in 
7  of  40  (18%)  adjacent  lung  tissue  (QR,  0-0.8).  Based  on 
the  highest  QR  in  nonmalignant  tissue  as  the  cutoff,  28 
of  40  (70%)  of  primary  NSCLC  were  found  to  have 
TCF21  methylated.  Additionally,  peripheral  blood  mono¬ 
nuclear  cells  from  18  healthy  subjects  recruited  for 
genetic  epidemiologic  studies  were  analyzed  for  TCF21 
methylation  and  found  to  be  below  the  levels  of  detec¬ 
tion.  Figure  2B  shows  the  receiver  operating  character¬ 
istic  curve,  which  provides  evidence  for  the  excellent 
discriminatory  capacity  of  TCF21  methylation  in  sepa¬ 
rating  cancer  from  adjacent  normal  lung  tissue.  We  also 
correlated  quantitative  methylation  data  for  TCF21  with 
tumor  stage  using  the  Mann-Whitney  P  test.  Increased 


methylation  levels  (QR)  were  observed  with  increase  in 
stage  (stage  I-II,  P  =  0.008;  stage  I  combined  with  stages  II 
and  III,  P  =  0.002). 

We  further  tested  the  qPCR  assay  for  methylation  of 
TCF21  in  sputum  DNA  from  patients  with  lung  cancer 
(Fig.  2C).  We  analyzed  38  sputum  DNA  samples  (13  from 
cancer  and  25  noncancer  patients)  for  methylation  of 
TCF21 .  We  found  7  of  13  (54%)  cancer  sputa  (QR,  0-34.63) 
and  0  of  25  (0%)  noncancer  sputa  showed  methylation  of 
TCF21  (P  <  0.0001;  Fig.  2C).  Combination  of  nested  PCR 
and  qPCR  led  to  increase  in  methylation  frequencies 
from  7  of  13  (54%)  to  9  of  13  (70%)  in  cancer  cases  and 
from  0  of  25  (0%)  to  3  of  25  (12%)  in  chronic  obstructive 
pulmonary  diseases  (table  in  Fig.  2C).  Whereas  no 
methylation  was  detected  (Ct  values  of  50)  in  normal 
lung  and  normal  lymphocytes,  the  methylated  status  of 
the  newly  observed  five  positive  samples  (all  with  Ct 
values  <35)  following  two-round  PCR  protocol  was 
confirmed  through  DNA  sequencing  (data  not  shown). 
We  speculate  that  these  five  cases  might  be  at  higher  risk 
to  develop  lung  cancer  than  other  cases,  all  of  which  (like 
normal  lung  and  normal  lymphocytes)  showed  Ct  values 
of  50  even  after  two  rounds  of  PCR.  We  have  proven  the 
feasibility  of  the  nested  qPCR  combination  approach  for 
increasing  the  sensitivity  without  significantly  compro¬ 
mising  the  specificity.  Further  development  of  the  nested 
qPCR  assay  is  in  progress  to  assess  real  effect  of  this 
approach. 

We  further  tested  the  qPCR  assay  for  methylation  of 
TCF21  by  analysis  of  30  breast  tumors  and  their  adjacent 
nonmalignant  tissues.  We  found  aberrant  methylation  of 
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Figure  2.  A.  Methylation  levels  of 
mTCF21  in  NSCLC  ( Tumor ),  adjacent 
nonmalignant  lung,  and  peripheral 
blood  mononuclear  cells  from  cancer- 
free  individuals.  Methylation  levels 
were  quantitated  by  semiquantitative 
real-time  PCR.  Real-time  analysis  was 
done  as  described  in  Materials  and 
Methods.  Quantitative  ratio  is  defined 
as  the  ratio  of  the  fluorescence  emis¬ 
sion  intensity  values  for  the  PCR 
products  of  the  biomarker  gene  to  those 
of  PCR  products  of  MYOD1  multiplied 
by  100.  The  ratio  is  a  measure  for  the 
relative  level  of  methylation  in  an 
individual  sample.  Because  values  are 
expressed  on  a  log  scale,  completely 
negative  values  are  expressed  as  values 
of  0.01.  Solid  horizontal  bar,  threshold 
above  which  the  samples  are  consid¬ 
ered  positive  for  methylation.  B.  Re¬ 
ceiver  operating  characteristic  curves 
for  mTCF21  at  separating  cancer  from 
the  adjacent  nonmalignant  lung.  Re¬ 
ceiver  operating  characteristic  curves 
are  plots  of  the  true-positive  rate 
(7  axis)  against  the  false-positive  rate 
( X axis)  for  the  different  possible  cutoff 
points  of  a  diagnostic  test.  The  closer 
the  curve  follows  the  left-hand  border 
and  then  the  top  border  of  the  receiver 
operating  characteristic  space,  the  more 
accurate  the  test  [area  under  the  curve 
(AUC)  =  0.85].  C.  Methylation  levels 
of  m TCF21  in  sputum  from  NSCLC 
patients  and  in  sputa  from  patients 
without  malignancy  (chronic  obstruc¬ 
tive  pulmonary  diseases).  Methylation 
levels  were  quantitated  as  described  in 
Materials  and  Methods.  The  table 
shows  the  frequency  of  methylation  in 
the  sputum  samples  following  the 
combination  of  nested  PCR  and  qPCR. 
The  two-round  PCR  protocol  was 
carried  out  as  described  in  Materials 
and  Methods. 
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Table  1. 

Quantitative  data  for  methylation  of  TCF21  in  multiple  malignancies 

Tissue 

Quantitative  ratio  (range) 

%  Samples  positive  for  methylation  in  tumors  with 
QR  above  the  highest  in  normal  group 

Cancer 

Normal 

Breast 

Bladder 

Colon 

Leukemia 

0-850.68  ( n  =  30) 

0-52.60  (n  =  41) 
0-256.83  ( n  =  20) 

0-74.80  (n  =  27) 

0-2.0  (n  =  30) 

0-0  ( n  =  6) 

0-8.12  (n  =  20) 

0-0  ( n  =  12) 

20/30  (67) 

18/41  (44) 

15/20  (75) 

11/27  (41) 

TCF21  in  26  of  30  (87%)  of  breast  tumors  (QR,  0-850.68) 
and  in  13  of  30  (40%)  adjacent  breast  tissue  (QR,  0-2; 
Table  1).  Based  on  the  highest  QR  in  nonmalignant  tissue 
as  the  cutoff,  20  of  30  (67%)  of  primary  breast  tumors 
were  found  to  have  TCF21  methylated.  The  data  show 
that  TCF21  methylation  has  excellent  discriminatory 
capacity  at  separating  cancer  from  adjacent  nonmalig¬ 
nant  breast  tissue  (Table  1). 

Additionally,  we  tested  the  qPCR  assay  for  methyla¬ 
tion  of  TCF21  in  bladder  cancers,  colon  cancers,  and 
leukemias  with  corresponding  nonmalignant  tissues 
(Table  1).  Also,  in  the  case  of  bladder  cancer  cases,  a 
strong  trend  was  observed  between  methylation  and 
invasiveness  (52%  in  invasive  and  27%  in  noninvasive 
cases).  In  all  the  malignancies,  TCF21  methylation 
appears  to  have  excellent  discriminatory  capacity  at 
separating  cancer  from  nonmalignant  tissue. 


Discussion 

In  this  article,  we  identified  a  short  CpG-rich  sequence 
within  exon  1  of  the  TCF21  gene  that  was  differentially 
methylated  in  lung  cancer  cells  compared  with  normal 
lung  cells.  We  targeted  this  region  to  design  a  methyl- 
ation-specific  qPCR  assay  suitable  for  high-throughput 
analysis.  We  successfully  showed  the  application  of  this 
assay,  in  discriminating  patients  with  lung  cancer  from 
those  without  lung  cancer,  using  lung  cancer  biopsies 
and  sputa  from  cancer  patients.  We  further  showed 
similar  application  of  this  assay  with  multiple  other 
malignancies,  covering  different  organ  systems. 

Transition  between  epithelial  cell  to  mesenchymal  cell 
is  known  to  occur  during  tumorigenesis  (20,  21). 
Epithelial-mesenchymal  transition  has  been  described 
in  many  cancers  and  correlates  with  clinical  outcome 
(20).  Malignant  lesions  are  often  defined  by  their 
differentiation  status,  where  benign  tumors  and  low- 
grade  cancers  typically  retain  their  epithelial  phenotype 
and  malignant  cells  acquire  more  fibroblastic  mesenchy¬ 
mal  phenotype  (20).  As  mentioned  in  the  results.  Smith 
et  al.  (11)  previously  reported  highly  significant  induc¬ 
tion  of  WNT4  expression  in  lung  cancer  cells  ( TCF21 
expression  negative)  when  transfected  with  TCF21. 
WNT4  has  been  associated  previously  with  epithelial- 
mesenchymal  transition  and  epithelial  phenotype  (22). 
We  observed  significant  expression  of  WNT4  in  HBECs 
with  down-regulation  in  all  the  lung  cancer  cells 
analyzed  in  our  study,  except  H661  ( TCF21  expression 
positive).  Our  results  suggest  the  interesting  possibility 
that  WNT4  might  be  one  of  the  proximate  downstream 
targets  of  TCF21.  This  possibility  should  be  further  ex¬ 
plored.  Thus,  our  bioassay  to  quantitate  the  prevalence 
of  methylation  of  TCF21  in  clinical  samples  may  help 


determine  the  risk  of  cancer  progression  in  patients  and 
have  considerable  clinical  application. 

It  appears  from  our  data  that  methylation  of  the  short 
CpG-rich  sequence  in  the  TCF21  gene  may  have 
significant  potential  as  a  biomarker  assay  in  lung  cancer 
and  multiple  other  cancers.  As  evident  from  sputum 
analysis,  the  feasibility  of  increasing  the  sensitivity  with¬ 
out  significantly  compromising  the  specificity  (through 
combination  of  nested  and  qPCR)  might  make  the  assay 
more  appealing  for  clinical  as  well  as  epidemiologic 
applications.  Studies  are  in  progress  to  further  evaluate 
the  potential  of  the  assay  in  noninvasive  detection  of 
lung  and  other  cancers. 


References 

1.  Jones  PA,  Laird  PW.  Cancer  epigenetics  comes  of  age.  Nat  Genet 
1999;21:163-7. 

2.  Sidransky  D.  Emerging  molecular  markers  of  cancer.  Nat  Rev  Cancer 
2002;2:210-219. 

3.  Eads  CA,  Nickel  AE,  Laird  PW.  Complete  genetic  suppression 
of  polyp  formation  and  reduction  of  CpG-island  hypermethyla- 
tion  in  Apc(Min/+)  Dnmtl-hypomorphic  mice.  Cancer  Res  2002;62: 
1296-9. 

4.  Eads  CA,  Lord  RV,  Wickramasinghe  K,  et  al.  Epigenetic  patterns  in 
the  progression  of  esophageal  adenocarcinoma.  Cancer  Res  2001;61: 
3410-8. 

5.  Brabender  J,  Usadel  H,  Danenberg  KD,  et  al.  Adenomatous  polyposis 
coli  gene  promoter  hypermethylation  in  non-small  cell  lung  cancer  is 
associated  with  survival.  Oncogene  2001;20:3528-32. 

6.  Usadel  H,  Brabender  J,  Danenberg  KD,  et  al.  Quantitative  adenoma¬ 
tous  polyposis  coli  promoter  methylation  analysis  in  tumor  tissue, 
serum,  and  plasma  DNA  of  patients  with  lung  cancer.  Cancer  Res 
2002;62:371-5. 

7.  Harden  SV,  Tokumaru  Y,  Westra  WH,  et  al.  Gene  promoter 
hypermethylation  in  tumors  and  lymph  nodes  of  stage  I  lung  cancer 
patients.  Clin  Cancer  Res  2003;9:1370-5. 

8.  Topaloglu  O,  Hoque  MO,  Tokumaru  Y,  et  al.  Detection  of  promoter 
hypermethylation  of  multiple  genes  in  the  tumor  and  bronchoalveo- 
lar  lavage  of  patients  with  lung  cancer.  Clin  Cancer  Res  2004;10: 
2284-8. 

9.  Shivapurkar  N,  Stastny  V,  Suzuki  M,  et  al.  Application  of  a 
methylation  gene  panel  by  quantitative  PCR  for  lung  cancers.  Cancer 
Lett  2007;247:56-71. 

10.  Quaggin  SE,  Schwartz  L,  Cui  S,  et  al.  The  basic-helix-loop-helix 
protein  podl  is  critically  important  for  kidney  and  lung  organogen¬ 
esis.  Development  1999;126:5771-83. 

11.  Smith  LT,  Lin  M,  Brena  RM,  et  al.  Epigenetic  regulation  of  the  tumor 
suppressor  gene  TCF21  on  6q23-24  in  lung  and  head  and  neck 
cancer.  Proc  Natl  Acad  Sci  U  S  A  2006;103:982-7. 

12.  Shigematsu  H,  Suzuki  M,  Takahashi  T,  et  al.  Aberrant  methylation  of 
HIN-1  (high  in  normal-1)  is  a  frequent  event  in  many  human 
malignancies.  Int  J  Cancer  2005;113:600-4. 

13.  Herman  JG,  Graff  JR,  Myohanen  S,  et  al.  Methylation-specific  PCR:  a 
novel  PCR  assay  for  methylation  status  of  CpG  islands.  Proc  Natl 
Acad  Sci  U  S  A  1996;93:9821-6. 

14.  Shigematsu  H,  Takahashi  T,  Nomura  M,  et  al.  Somatic  mutations  of 
the  HER2  kinase  domain  in  lung  adenocarcinomas.  Cancer  Res  2005; 
65:1642-6. 

15.  Toyooka  KO,  Toyooka  S,  Maitra  A,  et  al.  Establishment  and 
validation  of  real-time  polymerase  chain  reaction  method  for 
CDH1  promoter  methylation.  Am  J  Pathol  2002;161:629-34. 


Cancer  Epidemiol  Biomarkers  Prev  2008;17(4).  April  2008 


1000  A  Promising  Cancer  Biomarker  Assay 


16.  Shivapurkar  N,  Stastny  V,  Takahashi  T,  et  al.  Novel  real-time  PCR 
assay  using  a  universal  molecular  marker  for  diagnosis  of  hemato¬ 
logic  cancers.  Int  J  Cancer  2005;116:656-60. 

17.  Fackler  MJ,  McVeigh  M,  Mehrotra  J,  et  al.  Quantitative  multiplex 
methylation-specific  PCR  assay  for  the  detection  of  promoter 
hypermethylation  in  multiple  genes  in  breast  cancer.  Cancer  Res 
2004;64:4442-52. 

18.  Ramirez  RD,  Sheridan  S,  Girard  L,  et  al.  Immortalization  of  human 
bronchial  epithelial  cells  in  the  absence  of  viral  oncoproteins.  Cancer 
Res  2004;64:9027-34. 


19.  Fuks  F.  DNA  methylation  and  histone  modifications:  teaming  up  to 
silence  genes.  Curr  Opin  Genet  Dev  2005;15:490-5. 

20.  Hugo  H,  Ackland  ML,  Blick  T,  et  al.  Epithelial-mesenchymal  and 
mesenchymal-epithelial  transitions  in  carcinoma  progression.  J  Cell 
Physiol  2007;213:374-83. 

21.  Guarino  M,  Rubino  B,  Ballabio  G.  The  role  of  epithelial-mesenchymal 
transition  in  cancer  pathology.  Pathology  2007;39:305-18. 

22.  Taki  M,  Kamata  N,  Yokoyama  K,  et  al.  Down-regulation  of  Wnt-4  and 
up-regulation  of  Wnt-5a  expression  by  epithelial-mesenchymal  transi¬ 
tion  in  human  squamous  carcinoma  cells.  Cancer  Sci  2003;94:593  -  7. 


Cancer  Epidemiol  Biomarkers  Prev  2008;17(4).  April  2008 


Cancer  Prevention  Research 


Cyclic  AMP-Responsive  Element  Binding  Protein-  and  Nuclear 
Factor-KB-Regulated  CXC  Chemokine  Gene  Expression 
in  Lung  Carcinogenesis 


Hongxia  Sun,  Wen-Cheng  Chung,  Seung-Hee  Ryu,  Zhenlin  Ju,  Hai  T.  Tran,  Edward  Kim, 
Jonathan  M.  Kurie  and  Ja  Seok  Koo 


Abstract  The  recognition  of  the  importance  of  angiogenesis  in  tumor  progression  has  led  to  the 
development  of  antiangiogenesis  as  a  new  strategy  for  cancer  treatment  and  prevention. 
By  modulating  tumor  microenvironment  and  inducing  angiogenesis,  the  proinflammatory 
cytokine  interleukine  (IL)-1  (3  has  been  reported  to  promote  tumor  development.  However, 
the  factors  mediating  IL-1(3-induced  angiogenesis  in  non-small  cell  lung  cancer  (NSCLC) 
and  the  regulation  of  these  angiogenic  factors  by  IL-1  (3  are  less  clear.  Here,  we  report  that 
IL-1  p  up-regulated  an  array  of  proangiogenic  CXC  chemokine  genes  in  the  NSCLC  cell  line 
A549  and  in  normal  human  tracheobronchial  epithelium  cells,  as  determined  by  microarray 
analysis.  Further  analysis  revealed  that  IL-1  p  induced  much  higher  protein  levels  of  CXC 
chemokines  in  NSCLC  cells  than  in  normal  human  tracheobronchial  epithelium  cells.  Con¬ 
ditioned  medium  from  IL-lp-treated  A549  cells  markedly  increased  endothelial  cell  migra¬ 
tion,  which  was  suppressed  by  neutralizing  antibodies  against  CXCL5  and  CXCR2.  We  also 
found  that  IL-lp-induced  CXC  chemokine  gene  overexpression  in  NSCLC  cells  was  abro¬ 
gated  with  the  knockdown  of  cyclic  AMP-responsive  element  binding  protein  (CREB)  or 
nuclear  factor  kB  (NF-kB).  Moreover,  the  expression  of  the  CXC  chemokine  genes  as  well 
as  CREB  and  NF-kB  activities  was  greatly  increased  in  the  tumorigenic  NSCLC  cell  line 
compared  with  normal,  premalignant  immortalized  or  nontumorigenic  cell  lines.  A  disruptor 
of  the  interaction  between  CREB-binding  protein  and  transcription  factors  such  as  CREB 
and  NF-kB,  2-naphthol-AS-E-phosphate  (KG-501),  inhibited  IL-1  p-induced  CXC  chemokine 
gene  expression  and  angiogenic  activity  in  NSCLC.  We  propose  that  targeting  CREB  or 
NF-kB  using  small-molecule  inhibitors,  such  as  KG-501,  holds  promise  as  a  preventive 
and/or  therapeutic  approach  for  NSCLC. 


Lung  cancer  is  the  leading  cause  of  cancer  deaths  both  in  the 
United  States  and  worldwide  (1).  The  growth  and  develop¬ 
ment  of  lung  cancer  as  well  as  of  other  solid  tumors  is  criti¬ 
cally  dependent  on  a  functional  vascular  supply.  Numerous 
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lines  of  evidence  have  shown  that  angiogenesis,  the  formation 
of  new  blood  vessels  from  the  preexisting  vasculature,  is  one 
of  the  critical  steps  in  the  entire  process  of  cancer  development 
from  tumor  growth  to  distant  metastasis  (2-5).  In  addition,  a 
study  showed  that  a  solid  tumor  would  remain  dormant  at  a 
volume  of  only  2  to  3  mm3  in  the  absence  of  neovasculariza¬ 
tion  and  would  be  unable  to  metastasize  (6).  For  solid  tumors 
to  develop  and  metastasize,  they  need  to  secrete  a  number  of 
proangiogenic  factors  to  induce  the  formation  of  new  blood 
vessels  to  overcome  the  physical  limitations  on  the  diffusion 
of  nutrients  and  oxygen  within  the  tumor — a  process  known 
as  angiogenic  switch  (3).  Recognizing  the  importance  of  angio¬ 
genesis  in  the  growth  of  tumors  has  led  to  the  development  of 
antiangiogenesis  as  a  new  strategy  for  cancer  therapy  and 
prevention,  a  novel  concept  termed  //angioprevention,/  (7-9). 
Actually,  a  series  of  molecules  proposed  as  chemopreventive 
agents  have  been  shown  to  have  potent  antiangiogenic 
properties  when  tested  in  in  vitro  and  in  vivo  angiogenesis 
models  (7). 

Angiogenesis  can  be  regulated  by  various  growth  factors 
and  cytokines,  including  vascular  endothelial  growth  factor 
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(VEGF),  basic  fibroblast  growth  factor,  transforming  growth 
factors  a  and  p,  platelet-derived  endothelial  cell  growth 
factors,  chemokines,  and  interleukine  (IL)-lp  (10-14).  Recent 
studies  have  shown  the  importance  of  the  tumor  microenvi¬ 
ronment  in  facilitating  angiogenesis  and  promoting  tumor 
invasion  and  metastasis  (15-19).  Once  a  tumor  is  vascularized, 
the  tumor-associated  antigens  can  be  recognized  by  the  im¬ 
mune  system  and  the  tumor  is  infiltrated  by  leukocytes. 
Although  leukocyte  infiltration  in  tumors  is  often  considered 
to  be  associated  with  better  prognosis  and  overall  survival, 
studies  have  also  shown  that  inflammatory  cells  can  promote 
tumor  cell  proliferation,  angiogenesis,  metastasis,  and,  hence, 
tumor  development  (15,  16).  Leukocyte  infiltration  can  influ¬ 
ence  angiogenesis  in  tumors  because  some  subsets  of  leuko¬ 
cytes,  especially  the  tumor-associated  macrophages,  can 
secrete  both  angiostatic  and  angiogenic  factors  (17,  18).  IL-1 
is  a  proinflammatory  cytokine  produced  mainly  by  monocytes 
and  macrophages.  There  are  two  IL-1  agonistic  proteins,  IL-1  a 
and  IL-lp.  IL-1  a  is  a  precursor  or  membrane-associated  mole¬ 
cule  and  is  primarily  a  regulator  of  intracellular  events  and  a 
mediator  of  local  reactions.  On  the  other  hand,  IL-lp  acts  as  a 
systemic,  hormone-like  mediator  and  is  only  active  in  a  se¬ 
creted  mature  form.  However,  once  these  two  proteins  bind 
to  their  receptors,  they  have  similar  biological  activities  (20). 
Both  IL-1  a  and  IL-lp  can  promote  tumor  angiogenesis,  but 
the  role  of  IL-lp  is  more  evident  (14).  IL-1  has  been  shown 
to  contribute  to  the  production  of  proangiogenic  factors  VEGF, 
hepatocyte  growth  factor,  tumor  necrosis  factor,  and  CXC  che¬ 
mokines  (14,  21).  Members  of  a  subfamily  of  CXC  chemokines 
sharing  a  characteristic  glutamate-leucine-arginine  (ELR)  mo¬ 
tif  near  the  NH2  terminus  of  the  molecule  are  chemoattrac¬ 
tants  for  neutrophils  and  are  important  for  wound  repair. 
The  ELR-positive  chemokines,  including  CXCL1,  CXCL2, 
CXCL3,  CXCL5,  CXCL6,  CXCL7,  and  CXCL8,  are  proangio¬ 
genic,  whereas  members  of  another  subfamily  lacking  the 
ELR  motif — ELR-negative  chemokines,  such  as  CXCL4, 
CXCL9,  CXCL10,  and  CXCL11 — are  in  general  IFN  inducible 
and  are  potential  inhibitors  of  angiogenesis.  Generally,  CXCR2 
is  the  receptor  for  angiogenic  CXC  chemokine-mediated  an¬ 
giogenesis,  and  CXCR3  is  the  receptor  for  angiostatic  IFN- 
inducible  CXC  chemokine  inhibition  of  angiogenesis  (13). 
CXC  chemokine  ligands  and  receptors  have  been  shown  to 
play  important  roles  in  mediating  non-small  cell  lung  cancer 
(NSCLC)-associated  angiogenesis  and  organ-specific  metas- 
tases  (13).  Recently,  it  has  been  reported  that  CXCL5  and 
CXCL8  protein  levels  were  elevated  in  tumor  specimens 
freshly  isolated  from  patients  with  NSCLC  and  that  these 
two  ELR-positive  CXC  chemokines  are  important  mediators 
of  angiogenesis  during  NSCLC  tumorigenesis  (22,  23).  Com¬ 
pared  with  CXCL8,  CXCL5  was  reported  to  have  a  higher 
degree  of  correlation  with  NSCLC-derived  angiogenesis  (23). 
In  a  model  system  of  human  NSCLC  tumorigenesis  in  severe 
combined  immunodeficiency  mice,  CXCL5  expression  was 
found  to  be  directly  correlated  with  tumor  growth,  tumor- 
derived  angiogenesis,  and  metastatic  potential.  Depletion  of 
CXCL5  in  this  model  system  resulted  in  attenuation  of  both 
tumor  growth  and  spontaneous  metastasis  due  to  the  inhibi¬ 
tion  of  angiogenesis  (23). 

Being  a  product  of  tumor  infiltrated  macrophages,  IL-1(3>  is 
known  to  increase  angiogenesis.  However,  in  NSCLC,  what 
angiogenic  factors  are  induced  by  IL-1  (3>  and  how  they  are 


regulated  by  IL-1(3>  are  still  not  clear.  To  elucidate  these  critical 
issues,  we  conducted  a  microarray  analysis  to  determine  the 
effect  of  IL-1(3>  on  global  gene  expression  in  the  NSCLC  ade¬ 
nocarcinoma  cell  line  A549  and  in  normal  human  tracheo¬ 
bronchial  epithelium  (NHTBE)  cells.  We  found  that  IL-1  D 
dramatically  induced  the  expression  of  an  array  of  proangio¬ 
genic  CXC  chemokine  genes  and  significantly  augmented  the 
angiogenic  activity  of  NSCLC.  In  addition,  we  found  that  tran¬ 
scription  factors  cyclic  AMP-responsive  element  binding  pro¬ 
tein  (CREB)  and  nuclear  factor  kB  (NF-kB)  both  play  critical 
roles  in  the  regulation  of  IL-1  p> -induced  CXC  chemokine  gene 
expression  and  angiogenic  activity. 

Materials  and  Methods 

Cell  cultures,  chemicals,  and  conditioned  media 

NHTBE  cells  were  purchased  from  Cambrex  and  cultured  in 
six-well  plates  as  described  previously  (24-28).  Human  umbilical  vein 
endothelial  cells  (HUVEC)  were  purchased  from  Cambrex  and  main¬ 
tained  in  EGM  complete  endothelial  growth  medium.  HUVECs  were 
used  at  passage  4.  The  human  NSCLC  cell  lines  A549,  HI 734,  H226, 
and  H2170  were  obtained  from  the  American  Type  Culture  Collection 
and  grown  in  RPMI  1640  containing  10%  fetal  bovine  serum, 
100  units/mL  penicillin,  and  100  gg/mL  streptomycin.  The  BEAS- 
2B,  1799,  1198,  and  1170-1  human  bronchial  epithelial  cell  lines  were 
obtained  from  Dr.  R.  Lotan  (The  University  of  Texas  M.D.  Anderson 
Cancer  Center,  Houston,  TX)  and  Dr.  A.  Klein-Szanto  (Fox  Chase 
Cancer  Center,  Philadelphia,  PA)  and  grown  in  Keratinocyte 
Serum-Free  Medium  (Life  Technologies,  Inc.)  containing  epidermal 
growth  factor  and  bovine  pituitary  extract  (29).  All  of  the  cells  were 
cultured  at  37°C  in  a  humidified  atmosphere  of  95%  air  and  5%  C02. 
Conditioned  media  (CM)  were  generated  as  follows:  NSCLC  cells 
(3  x  105/mL)  were  cultured  in  RPMI  1640  containing  0.5%  fetal  bovine 
serum  with  or  without  IL-1  ^  and/or  2-naphthol-AS-E-phosphate 
(KG-501)  at  different  concentrations  for  24  h.  Cell-free  supernatants 
were  collected  and  stored  at  -70°C  until  use. 

Reagents  and  antibodies 

IL-lfi,  Quantikine  CXCL5  and  CXCL8  ELISA  kits,  and  neutralizing 
antibodies  against  CXCL5,  CXCL8,  CXCR2,  and  VEGF  were  pur¬ 
chased  from  R&D  Systems.  KG-501  was  purchased  from  Sigma- 
Aldrich  and  dissolved  in  DMSO.  Antibodies  against  NF-kB  p65, 
CREB,  and  phospho-CREB  (Ser133)  were  purchased  from  Santa  Cruz 
Biotechnology,  Inc.,  and  Upstate,  respectively.  A  monoclonal  antibody 
(mAb)  against  p-actin  was  from  Sigma-Aldrich.  Transwell  chambers 
with  polyethylene  terephthalate  membranes  containing  8-gm  pores 
were  obtained  from  BD  Biosciences. 

Microarray  analysis 

After  confluence,  NHTBE  and  A549  cells  were  treated  with  control 
medium  or  the  same  medium  containing  2.5  ng/mL  IL-lp  for  8  h  be¬ 
fore  total  RNA  extraction.  Total  RNA  was  isolated  with  the  RNeasy 
Mini  Kit  (Qiagen).  The  integrity  of  mRNA  and  the  relative  rRNA  con¬ 
tamination  were  analyzed  with  the  RNA  6000  Nano  LabChip  (Agilent 
Technologies)  and  the  Agilent  2100  bioanalyzer  (Agilent  Technolo¬ 
gies).  The  RNA  from  control  group  was  amplified  and  labeled  with 
cyanine  3,  and  that  from  IL-lp-treated  cells  with  cyanine  5.  Equal 
amounts  of  the  differently  labeled  RNAs  were  then  mixed  and  hybri¬ 
dized  with  44K  whole  human  genome  oligonucleotide  microarrays 
(Agilent  Technologies).  After  hybridization,  the  arrays  were  scanned 
and  the  resulting  images  were  analyzed  using  the  Agilent  feature  ex¬ 
traction  software  program  (GE2,  version  5.91,  Agilent  Technologies). 

Quantitative  reverse  transcription-PCR 

Validation  of  the  differentially  expressed  genes  in  NHTBE  and 
NSCLC  cells  was  done  using  an  iCycler  real-time  PCR  detection 
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system  (Bio-Rad)  with  gene-specific  primers.  Primers  for  human  gly- 
ceraldehyde-3-phosphate  dehydrogenase  ( GAPDH ),  the  reference 
gene,  were  5'-TGCACCACCAACTGCTTAGC  (forward)  and  5'- 
GGCATGGACTGTGGTCATGAG  (reverse);  for  CXCL2,  5'-AGTGA- 
CAAATCCAACTGACC  (forward)  and  5'-GATGCTCAAACACAT- 
TAGGC  (reverse);  for  CXCL2,  5'-CCCAAGTTAGTTCAATCCTG 
(forward)  and  5'-TTCCTCAGCCTCTATCACAG  (reverse);  for 
CXCL3,  5'-CTTGTCTC AACCCCGCATCC  (forward)  and 
5'-TCTGGTAAGGGCAGGGACCA  (reverse);  for  CXCL5,  5'- 
TCCAATCTCCGCTCCTCCAC  (forward)  and  5 '- AGC  AGC  AGC  AG- 
CACCAACAG  (reverse);  for  CXCL6,  5'-GTTTGTCTGGACCCG- 
GAAGC  (forward)  and  5'-TCCGCTGAAGACTGGGCAAT  (reverse); 
for  CXCL8,  5'-GCATAAAGACATACTCCAAACC  (forward)  and 
5 '- ACTTCTCC  AC  A  ACCCTCTG  (reverse);  and  for  CREB,  5'-AAGCT- 
GAAAACCAACAAATGACAGTT  (forward)  and  5'-TGAACTGTC- 
TGCCCATTGG  (reverse).  Single-stranded  cDNAs  were  synthesized 
in  50  pL  of  reverse  transcription  (RT)  mix  containing  1  pg  of  total 
RNA  using  the  GeneAmp  RNA  PCR  Core  Kit  (Applied  Biosystems) 
according  to  the  manufacturer's  instructions.  PCR  analysis  was  done 
using  25-pL  volumes  with  SYBR  Green  PCR  Core  Reagents  (Applied 
Biosystems).  Primers  (200  nmol/L)  and  RT  mix  (2  pL)  were  used  in 
each  PCR.  Each  sample  was  assayed  in  triplicate  per  PCR  run  and 
the  experiment  was  repeated  thrice.  The  cycling  conditions  were  an 
initial  denaturation  at  95°C  for  10  min,  40  cycles  of  denaturation  at 
95°C  for  15  s,  and  elongation  at  60°C  for  60  s.  The  real-time  PCR  data 
were  analyzed  using  the  comparative  Ct  method. 

Transfection  of  small  interfering  RNA  against  NF-kB 

p65  and  transduction  of  lentiviral  short  hairpin  RNA 
against  CREB 

SMARTpool-sequenced  small  interfering  RNA  (siRNA)  targeting 
human  NF-kB  p65  (GenBank  accession  no.  NM_021975)  and  nonspe¬ 
cific  control  pool  siRNA  were  purchased  from  Dharmacon  RNA  Tech¬ 
nologies  and  diluted  to  20  pmol/L.  NSCLC  cells  at  50%  confluence 
were  transfected  with  siRNA  for  NF-kB  p65  or  control  siRNA  at  final 
concentrations  of  50  and  100  nmol/L  using  the  Lipofect AMINE  2000 
transfection  reagent  (Invitrogen)  according  to  the  manufacturer's  in¬ 
structions.  Seventy-two  hours  after  transfection,  the  cells  were  treated 
with  IL-lp.  for  8  h.  Total  protein  and  RNA  were  collected  from  each 
sample. 

For  CREB-targeting  viral  short  hairpin  RNA  (shRNA)  delivery,  the 
lentiviral  plasmid  pLKO.l  with  a  shRNA  clone  against  CREB  (clone 
ID:  TRCN0000011085)  was  purchased  from  Open  Biosystems.  The 
pLKO.l  plasmid  with  a  scrambled  shRNA  sequence  and  virus  packa¬ 
ging  plasmids  (psPAX2  and  pseudo-typing  plasmid  pMD2.G)  were 
obtained  from  Addgene.  HEK  293T  cells  were  obtained  from  the 
American  Type  Culture  Collection.  HEK  293T  cells  at  50%  confluence 
were  cotransfected  with  pLKO.l  and  virus  packaging  plasmids  using 
FuGENE  6  transfection  reagent  (Roche  Applied  Science).  After  trans¬ 
fection  for  16  h,  the  medium  containing  the  transfection  agent  was  re¬ 
placed  with  fresh  growth  medium.  Viral  particles  were  then  collected 
from  the  medium  every  24  h  twice.  The  virus  titer  in  the  pooled  sus¬ 
pension  was  determined  by  counting  the  puromycin-resistant  colonies 
in  the  virus-transduced  culture.  For  knockdown  of  CREB,  cells  at  50% 
confluence  were  incubated  with  viral  suspension  at  a  multiplicity  of 
infection  of  about  50  for  16  h.  Four  days  after  transduction,  the  cells 
were  treated  with  IL-lp  for  24  h  and  total  protein  and  RNA  were  col¬ 
lected  from  the  cells. 

Migration  assay 

HUVECs  (5  x  104)  were  suspended  in  serum-free  RPMI  1640  and 
seeded  in  transwell  chambers  coated  with  gelatin.  CM  from  NSCLC 
cells  were  applied  in  the  outer  chambers.  HUVECs  were  incubated  at 
37°C  in  5%  C02  for  16  h.  Following  incubation,  cells  were  fixed  in  90% 
ethanol  and  stained  with  0.1%  crystal  violet.  Nonmigrated  cells  on  the 
upper  surface  of  the  chamber  filters  were  removed  by  swabbing,  and 
the  cells  that  had  migrated  through  the  filter  were  photographed  un¬ 


der  a  microscope  and  quantified  using  the  ImageJ  software  program 
(NIH,  Bethesda,  MD).1 

Western  blot  analysis 

Western  blot  analysis  of  target  proteins  was  done  as  described  pre¬ 
viously  (30).  Equal  amounts  of  protein  (30  pg)  were  resolved  using 
10%  SDS-PAGE.  The  mouse  mAb  against  human  NF-kB  p65  was  di¬ 
luted  in  5%  nonfat  milk  at  a  ratio  of  1:200  and  rabbit  polyclonal  anti¬ 
bodies  against  human  CREB  and  p-CREB  were  diluted  at  a  ratio  of 
1:1,000  and  incubated  with  the  membranes  overnight  at  4°C.  Proteins 
reactive  with  the  primary  antibody  were  visualized  with  a  horserad¬ 
ish  peroxidase-conjugated  goat  anti-mouse  or  goat  anti-rabbit  second¬ 
ary  antibody  and  enhanced  chemiluminescence  reagents  (Amersham 
Bioscience). 

Measurement  of  CXC  chemokine  protein  secretion 

The  levels  of  secreted  CXC  chemokines  in  the  CM  were  measured 
with  immuno-dot  blotting  and  ELISA.  Immuno-dot  blotting  was  done 
as  described  before  (24,  28).  Briefly,  the  CM  were  applied  to  a  nitro¬ 
cellulose  membrane  using  the  Manifold  I  Dot-Blot  System  (Scheleicher 
&  Schuell).  The  membrane  was  then  probed  with  antihuman  CXCL5 
and  CXCL8  antibodies  and  the  target  proteins  were  detected  by  che¬ 
miluminescence  and  quantified  by  densitometry.  ELISA  was  done 
with  Quantikine  CXCL5  and  CXCL8  ELISA  kits  (R&D  Systems)  ac¬ 
cording  to  the  manufacturer's  instruction.  Both  measurements  were 
normalized  against  the  cell  number. 

Chromatin  immunoprecipitation  analysis 

Chromatin  immunoprecipitation  assay  was  done  using  EZ  ChIP 
kits  (Upstate)  according  to  the  manufacturer's  instruction.  Briefly, 
NHTBE,  BEASE-2B,  1799,  1198,  and  1170-1  cells  were  grown  in  plate 
with  normal  media  and  chromatins  were  cross-linked  by  reaction  with 
1%  formaldehyde  for  10  min.  The  cross-linked  chromatins  were  frag¬ 
mented  by  sonication  and  subsequently  immunoprecipitated  with 
anti-CREB  (Upstate)  or  anti-NF-KB  p65  (Santa  Cruz  Biotechnology). 
The  DNA  in  the  precipitate  was  purified  and  used  as  the  template 
for  PCR.  The  primers  for  CXCL5  promoter  with  NF-kB  binding  site 
were  5'-TAGAGGTGCACGCAGCTCCT  (forward)  and  5'-GAG- 
CACTGTGGCTTCCTCGT  (reverse);  for  CXCL5  promoter  with  CREB 
binding  site,  5'-CTGGACACACGTATACTTGC  (forward)  and  5'- 
GGCAGGTCATTCTAGGTTTC  (reverse);  and  for  CXCL8  promoter 
with  both  CREB  and  NF-kB  binding  sites,  5'-AAAACTTTCGTCA- 
TACTCCG  (forward)  and  5'-AAAGTTTGTGCCTTATGGAG  (re¬ 
verse).  PCR  products  were  then  separated  in  1.2%  agarose  gel  and 
stained  with  GelRed  (Biotium). 

Statistical  analysis 

Each  experiment  presented  in  the  figures  was  repeated  three  or 
more  times.  The  data  are  presented  as  the  mean  ±  SE.  Comparisons 
between  groups  were  evaluated  using  ANOVA  and  a  two-tailed  Stu¬ 
dent's  t  test.  P  <  0.05  was  considered  statistically  significant. 

Results 

Microarray  analysis  revealed  an  array  of  CXC 

chemokine  genes  up-regulated  hy  IL-1|3  in  NHTBE  and 
A549  cells 

To  identify  IL-1  (3-responsive  genes  involved  in  angiogen¬ 
esis  and  tumorigenesis  in  NSCLC,  we  used  total  RNA  iso¬ 
lated  from  NHTBE  and  A549  cells  treated  with  or  without 
2.5  ng/mL  IL-1(3  for  8  hours  to  generate  cRNA  for  micro¬ 
array  hybridization  and  analysis  using  44K  whole  human 


1  http://rsb.info.nih.gov/ij 
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Table  1.  Analysis  of  IL-1p  induction  of  CXC  chemokine  gene  expression  in  NHTBE  and  A549  cells  using 
microarray  and  real-time  PCR  analysis 


Gene  name 

Systematic  name 

Gene  description 

NHTBE  (fold 
change) 

Microarray  PCR 

A549  (fold  change) 

Microarray  PCR 

CXC  LI 

NM_00151 1 

Homo  sapiens  chemokine  (C-X-C  motif)  ligand  1 

18.8 

11.9 

32.5 

26.9 

CXCL2 

NM_002089 

Homo  sapiens  chemokine  (C-X-C  motif)  ligand  2 

18.8 

12.9 

23.3 

14.9 

CXCL3 

NM_002090 

Homo  sapiens  chemokine  (C-X-C  motif)  ligand  3 

18.2 

29.9 

19.2 

11.7 

CXCL5 

NM_002994 

Homo  sapiens  chemokine  (C-X-C  motif)  ligand  5 

173.3 

95.9 

12.0 

8.2 

CXCL6 

NM_002993 

Homo  sapiens  chemokine  (C-X-C  motif)  ligand  6 

31.3 

80.6 

13.9 

28.8 

CXCL8 

NM_000584 

Homo  sapiens  chemokine  (C-X-C  motif)  ligand  8 

11.9 

17.6 

40.8 

32.4 

VEGF 

NM_001 025366 

Homo  sapiens  vascular  endothelial  growth  factor 

2.49 

1.93 

3.2 

2.14 

genome  oligonucleotide  microarrays.  Analysis  of  the  result¬ 
ing  microarray  images  using  the  Agilent  feature  extraction 
software  program  showed  that  the  expression  of  six  CXC 
chemokine  genes,  CXCL1,  CXCL2,  CXCL3,  CXCL5,  CXCL6, 
and  CXCL8,  was  commonly  up-regulated  in  response  to 
IL-1  (3>  in  both  NHTBE  and  A549  cells  (P  <  0.001).  As  ex¬ 
pected,  we  also  saw  that  the  expression  of  another  angio¬ 
genic  gene,  VEGF ,  was  up-regulated  by  IL-1  (3>  in  both 
NHTBE  and  A549  cells  (Table  1).  To  confirm  that  these  mi¬ 
croarray  data  reflect  the  IL-l|3-induced  gene  expression,  we 
examined  the  expression  of  CXCL1,  CXCL2,  CXCL3,  CXCL5, 
CXCL6,  and  CXCL8  genes  in  NHTBE  and  A549  cells  using 
quantitative  reverse  transcrip tion-PCR  (RT-PCR).  Indeed,  the 
expression  of  all  of  these  CXC  chemokine  genes  was 
strongly  induced  after  stimulation  by  IL-1(3>,  and  the  quanti¬ 
tative  RT-PCR  results  agreed  with  the  microarray  analysis 
results  (Table  1). 

IL-lp  differentially  regulates  CXCL5  and  CXCL8 

protein  secretion  in  NHTBE  and  NSCLC  cells 

According  to  the  microarray  and  quantitative  RT-PCR  data, 
IL-1(3>  stimulated  the  gene  expression  of  angiogenic  CXC  che- 
mokines  in  both  NHTBE  and  A549  cells.  We  next  verified 
whether  such  induction  of  gene  expression  was  indeed  trans¬ 
lated  into  protein  secretion  in  these  cells.  For  this  purpose,  we 
focused  on  CXCL5  and  CXCL8  because  they  reportedly  have 
dominant  angiogenic  effects  in  lung  cells.  Using  the  immuno- 
dot  blot  assay,  we  detected  the  secretion  of  CXCL5  and 
CXCL8  in  the  CM  from  all  four  NSCLC  cell  lines  (H226, 
H2170,  A549,  and  HI 734)  treated  with  or  without  IL-1  (3> . 
However,  by  the  same  method,  the  secretion  of  CXCL8  in 
NHTBE  cells  was  detected  only  in  IL-1  |3-treated  cells,  and 
not  in  untreated  cells,  and  the  CXCL5  protein  was  not  detect¬ 
able  even  with  IL-1(3»  treatment  (data  not  shown).  We  verified 
such  differential  expression  of  angiogenic  CXC  chemokines 
between  tumor  and  normal  cells  with  ELISA  (Fig.  1A  and 
B).  Again,  we  observed  a  significantly  greater  expression  of 
CXCL5  and  CXCL8  in  tumor  cells  than  in  normal  cells,  espe¬ 
cially  with  the  induction  of  IL-1(3>.  This  result  indicated  that 
although  IL-1|3  increases  CXCL5  mRNA  level  in  both  normal 
epithelial  cells  and  lung  cancer  cells,  only  the  lung  cancer 
cells  produce  a  significant  level  of  CXCL5  chemokine  protein. 
Because  A549  cells  produced  substantial  amounts  of  both 


CXCL5  and  CXCL8,  we  used  A549  cells  as  a  model  system 
to  further  explore  the  effect  of  IL-1|3  on  angiogenesis. 

To  further  assess  the  response  of  CXCL5  and  CXCL8  gene 
expression  to  IL-1|3  treatment,  we  measured  the  time  and 
dose  effect  of  IL-1  (S  on  A549  cells.  The  level  of  CXCL5  and 
CXCL8  transcription  peaked  at  4  hours  after  IL-1(3>  stimulation 
(Fig.  2A),  indicating  that  the  cells  responded  to  IL-1  (3>  by 
rapidly  inducing  the  expression  of  CXC  chemokine  genes. 


Fig.  1.  IL-1  (3  differentially  regulates  CXCL5  and  CXCL8  protein  secretion  in 
NHTBE  and  NSCLC  cells.  NHTBE,  H226,  H2170,  HI  734,  and  A549  cells  were 
treated  with  2.5  ng/mL  IL-1  (3  for  24  h.  CM  were  collected  from  these  cell  lines. 
The  concentrations  of  CXCL5  (A)  and  CXCL8  ( B )  in  CM  were  measured  by 
ELISA.  Results  were  normalized  according  to  the  cell  number.  Columns,  mean; 
bars,  SE. 
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Fig.  2.  IL-1  (3  stimulates  CXCL5  and  CXCL8  gene  expression  and  protein  secretion  in  A549  cells  in  a  time-  and  dose-dependent  manner.  A,  A549  cells  were 
incubated  with  control  medium  or  with  medium  containing  2.5  ng/ml_  IL-1  (3  for  the  indicated  times.  Total  RNAs  were  collected  from  the  cells  and  expression  of  the 
indicated  genes  was  measured  by  quantitative  RT-PCR.  GAPDH  was  used  as  an  internal  control  for  normalizing  the  RNA  loading.  B,  A549  cells  were  incubated 
with  IL-1  (3  at  the  indicated  concentrations  for  8  h.  Total  RNAs  were  collected  from  the  cells  for  quantitative  RT-PCR.  C,  culture  media  were  collected  from  A549 
cells  with  treatment  as  in  B  and  the  concentrations  of  CXCL5  and  CXCL8  were  measured  with  ELISA.  Columns,  mean;  bars,  SE.  *,  P  <  0.05;  **,  P  <  0.01 ; 

***,  P  <  0.001 ,  compared  with  untreated  control. 


We  found  that  CXCL5  mRNA  expression  was  significantly  in¬ 
duced  even  with  IL-1|3  at  0.1  ng/mL.  CXCL8  mRNA  expres¬ 
sion  reached  a  plateau  at  2.5  to  5.0  ng/mL  of  IL-1(3>  treatment 
(Fig.  2B).  We  confirmed  this  dose-dependent  response  by  mea¬ 
suring  the  CXCL  protein  secretion  with  ELISA  (Fig.  2C). 

IL-lp  significantly  augments  the  angiogenic  activity  of 

NSCLC  by  inducing  the  expression  of  angiogenic  CXC 

chemokine  genes 

Because  IL-1(3>  up-regulates  the  expression  of  important  an¬ 
giogenic  CXC  chemokine  genes  in  NSCLC,  we  inferred  that 
IL-1  (3>  may  enhance  the  net  angiogenic  activity  of  NSCLC. 
To  test  this  possibility,  we  conducted  an  endothelial  cell 
(HUVEC)  migration  assay  with  CM  from  A549  NSCLC  cell 
lines  treated  with  or  without  IL-1|3.  As  shown  in  Fig.  3 A 
(a  and  b),  compared  with  the  CM  from  untreated  A549  cells, 
the  CM  from  IL-l|3-treated  cells  (A549/IL-1|3)  markedly  in¬ 
creased  the  angiogenic  activity  as  assessed  by  the  migration 
of  HUVECs.  Similar  results  were  observed  with  CM  prepared 
from  IL-1  |3-trea  ted  HI 734  cells  (data  not  shown).  These  data 
clearly  indicated  that  IL-1|3  significantly  augments  the  net  an¬ 
giogenic  activity  of  NSCLC  as  measured  by  the  chemotactic 
activity  of  endothelial  cell.  Next,  we  hypothesized  that  the  in¬ 
creased  migration  of  HUVECs  in  response  to  CM  from  A549/ 
IL-1(3>  cells  was  attributable  to  the  observed  increased  expres¬ 
sion  of  angiogenic  CXC  chemokine  genes  in  NSCLC  cells  in¬ 
duced  by  IL-1  (3> .  To  test  this,  we  conducted  a  HUVEC 
migration  assay  with  CM  from  A549/IL-1|3>  cells  in  the  pre¬ 
sence  of  neutralizing  antibodies  against  CXCL5,  CXCL8, 


CXCR2,  and  VEGF.  We  found  that  the  migration  of  HUVECs 
was  reduced  significantly  in  the  presence  of  neutralizing  anti¬ 
bodies  against  CXCL5  and  CXCR2  (Fig.  3 A,  d  and  f).  However, 
we  did  not  observe  a  similar  effect  on  antibodies  against 
CXCL8  and  VEGF  (Fig.  3 A,  e  and  g).  These  results  suggested 
that  the  IL-l|3-induced  angiogenic  activity  of  NSCLC  is 
mainly  attributable  to  the  induced  expression  of  CXCR2- 
dependent  CXC  chemokines,  most  likely  CXCL5. 

NF-kB  and  CREB  mediate  IL-lp-induced  CXC 

chemokine  gene  expression 

To  investigate  the  mechanism  underlying  the  IL-1  p> -induced 
CXC  chemokine  gene  expression,  we  focused  on  the  tran¬ 
scription  factors  mediating  this  effect.  Sequence  analysis  of 
the  promoters  of  these  CXC  chemokine  genes  indicated 
the  potential  binding  sites  for  NF-kB,  activator  protein  1,  ac¬ 
tivator  protein  2,  activator  protein  3,  Spl,  IFN  regulatory 
factor  1,  hepatic  nuclear  factor  1,  and  CREB  (Fig.  4 A;  refs. 
31-34).  Because  IL-1  (S  activates  NF-kB  and  CREB,  and  the 
NF-kB  site  and  CRE-like  sites  are  located  in  the  promoters 
of  CXCL1,  CXCL2,  CXCL3,  CXCL5,  and  CXCL8  genes,  we 
sought  to  determine  whether  CREB  and  NF-kB  mediate 
IL-1  |3-induced  CXC  chemokine  gene  expression  in  NSCLC 
cells.  For  this  purpose,  we  abrogated  NF-kB  and  CREB  gene 
expression  in  A549  and  HI 734  cells  by  either  transfecting 
the  cells  with  a  siRNA  for  NF-kB  p65  or  transducing  the 
cells  with  a  lentivirus  containing  the  shRNA  for  CREB 
(shCREB).  We  confirmed  that  the  NF-kB  and  CREB  protein 
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Fig.  3.  IL-1  (3  significantly  augments  the 
angiogenic  activity  of  NSCLC  by  inducing  the 
expression  of  angiogenic  CXC  chemokine  genes. 
A,  CM  were  collected  from  A549  cells  treated  with 
either  control  medium  (a)  or  medium  containing 
2.5  ng/ml_  IL-1  [3  (b)  for  24  h,  and  CM  from 
IL-1(3-treated  A549  cells  were  preincubated  with 
10  |jg/ml_  of  nonimmune  IgG  (c),  an  anti-CXCL5 
mAb  (i d ),  an  anti-CXCL8  mAb  (e),  an  anti-CXCR2 
mAb  (/),  or  an  anti-VEGF  mAb  (g)  at  37°C  for  1  h.  A 
migration  assay  was  done  by  stimulating  HUVECs 
with  the  indicated  CM  as  described  in  Materials 
and  Methods.  Migrated  cells  were  photographed 
after  16  h  of  incubation  at  37°C.  B,  image  analysis 
of  HUVEC  migration  was  quantified  using  the 
ImageJ  software  program.  Columns,  mean;  bars, 
SE.  *,  P  <  0.05. 
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levels  were  knocked  down  by  >80%  after  siRNA  transfec¬ 
tion  or  viral  transduction  in  these  two  cell  lines  (Fig.  4B). 
We  then  treated  the  cells  with  IL-1 |3  and  measured  the  CXC  che¬ 
mokine  gene  expression  levels  in  the  cells.  Real-time  PCR 
results  showed  that  IL-l|3-induced  CXC  chemokine  gene  ex¬ 
pression  significantly  decreased  after  knockdown  of  NF-kB 
(Fig.  4C).  At  the  same  time,  shCREB-transduced  cells  had  no 
or  a  much  less  significant  response  to  IL-1  |3  in  the  induction 
of  CXC  chemokine  gene  expression  when  compared  with 
the  nontransduced  control  or  scrambled  shRNA-transduced 
cells  (Fig.  4D).  These  findings  suggested  that  both  CREB  and 
NF-kB  mediate  IL-l|3-induced  CXC  chemokine  gene  expres¬ 
sion  in  NSCLC  cells. 

The  expression  level  of  CXC  chemokine  genes  is 

associated  with  the  development  of  lung  cancer 

To  investigate  if  the  angiogenic  CXC  chemokine  genes  are 
involved  in  the  development  of  lung  cancer,  we  measured 
the  expression  levels  of  CXCL5  and  CXCL8  in  an  in  vitro  lung 
carcinogenesis  model  (IVLCM)  that  includes  normal 
(NHTBE),  immortalized  (BEAS-2B  and  1799),  transformed 
(1198),  and  tumorigenic  (1170-1)  human  bronchial  epithelial 
cells.  The  quantitative  RT-PCR  results  (Fig.  5A)  showed  that 
these  two  CXC  chemokine  genes  were  differentially  expressed 
in  the  cell  lines  of  IVLCM,  with  progressive  increase  from 
NHTBE  to  BEAS-2B  and  1799  and  then  further  to  1198  and 
1170-1.  As  we  showed  that  CREB  and  NF-kB  regulate  CXC 
chemokine  gene  expression,  we  examined  whether  the  activ¬ 
ities  of  these  transcription  factors  correspond  to  the  expression 
levels  of  CXCL5  and  CXCL8  in  these  IVLCM  cells.  Western 
blot  analysis  and  quantitative  RT-PCR  data  showed  that  both 


the  activity  and  the  expression  level  of  CREB  were  gradually 
increased  with  the  progression  of  tumor  development  in  the 
IVLCM  (Fig.  5B  and  C).  A  similar  pattern  was  also  observed 
with  NF-kB  level  (data  not  shown).  Such  trend  of  change  was 
consistent  with  the  tendency  of  their  expression  levels  of 
CXCL5  and  CXCL8.  We  further  conducted  a  chromatin  immu- 
noprecipitation  assay  to  determine  whether  CREB  and  NF-kB 
regulate  the  gene  expression  of  these  two  chemokines  in  the 
IVLCM  cells.  The  results  of  the  chromatin  immunoprecipita- 
tion  assay  (Fig.  5D)  showed  that  CREB  bound  to  the  promoter 
regions  of  both  CXCL5  and  CXCL8  in  all  these  cells,  and  that 
whereas  NF-kB  also  bound  to  the  CXCL8  promoter  in  all  these 
cells,  its  binding  to  the  CXCL5  promoter  was  only  detected  in 
the  tumorigenic  (1170-1)  cells.  These  results  indicated  that  the 
expression  of  the  angiogenic  CXC  chemokine  genes  in  these 
cells  is  well  correlated  with  the  progression  of  lung  cancer 
from  normal  to  invasive  phenotype  and  might  be  further  as¬ 
sociated  with  their  inherent  activities  of  CREB  and  NF-kB. 

KG-501  inhibits  the  endothelial  cell  migration  induced 
by  CM  from  IL-1  p -treated  NSCLC  cells  via  the 
suppression  of  CXC  chemokine  gene  expression  in 
NSCLC  cells 

In  looking  for  inhibitors  that  can  block  IL-l|3>-induced  en¬ 
dothelial  cell  migration,  we  focused  on  the  agents  targeting 
the  transcription  factors  NF-kB  and  CREB.  It  has  been  re¬ 
ported  that  KG-501  is  a  small  molecule  that  binds  to  the  tran¬ 
scription  coactivator  CREB-binding  protein  (CBP)  and  blocks 
the  interaction  of  CBP  with  the  active  form  of  CREB,  phospho- 
CREB  (35).  KG-501  can  also  inhibit  NF-kB  transcription  activ¬ 
ity  because  NF-kB  also  uses  CBP  as  a  cofactor  to  regulate  gene 


www.aacrjournals.org 


321 


Cancer  Prev  Res  2008;1(5)  October  2008 


Cancer  Prevention  Research 


expression  (36).  Based  on  our  results  that  CREB  and  NF-kB 
could  mediate  IL-1  |3-induced  CXC  chemokine  gene  expres¬ 
sion  in  NSCLC  cells,  we  hypothesized  that  KG-501  could  sup¬ 
press  the  expression  of  these  CXC  chemokine  genes  and 
inhibit  the  endothelial  cell  migration  induced  by  CM  from 
NSCLC/IL-1|3  cells.  As  shown  in  Fig.  6A  and  B,  the  migration 
of  HUVECs  induced  by  CM  from  A549  cells  treated  with  IL- 
lp>  plus  10  pmol/L  of  KG-501  was  significantly  lower  than 
that  induced  by  CM  from  A549  cells  treated  with  IL-1(3>  alone 
(P  <  0.05).  Next,  we  evaluated  the  effect  of  KG-501  on  the  tran¬ 
scriptional  and  proteomic  levels  of  CXC  chemokines  induced 
by  IL-1(3>.  At  10  pmol/L,  KG-501  suppressed  the  expression  of 
all  of  the  IL-1  p> -induced  CXC  chemokine  genes  except  CXCL8 
(Fig.  6C).  For  the  protein  level,  we  measured  the  expression  of 
CXCL5  and  CXCL8  in  A549  cells.  KG-501  significantly  sup¬ 
pressed  IL-l|3-induced  CXCL5  protein  secretion.  However, 
its  effect  on  IL-1  |3-induced  CXCL8  protein  secretion  was  not 
consistent  with  its  concentration,  with  a  stimulatory  effect  at 
low  concentrations  but  a  slightly  inhibitory  effect  at  high  con¬ 
centrations  (Fig.  6D).  Similar  effects  of  KG-501  were  also  ob¬ 
served  in  the  HI 734  cell  line  (data  not  shown). 

Discussion 

In  the  present  study,  we  showed  that  the  proinflammatory 
cytokine  IL-1  (3>  up-regulates  the  expression  of  an  array  of 
proangiogenic  CXC  chemokine  genes  in  the  NSCLC  cell  lines 
and  that  both  of  the  transcription  factors  CREB  and  NF-kB  can 
mediate  this  up-regulation.  IL-1  (3>  augments  the  angiogenic 
activity  of  NSCLC,  as  manifested  by  the  ability  of  CM  from 
IL-1  |3-trea ted  cells  to  induce  endothelial  cell  migration.  Our 
finding  that  the  transcription  factors  CREB  and  NF-kB  med¬ 
iate  IL-1  |3-induced  CXC  chomokine  gene  expression  extends 
our  knowledge  about  the  mechanism  of  gene  regulation  by 
the  angiogenic  factors  and  provides  new  potential  targets  for 
angioprevention.  Finally,  the  findings  that  the  small  molecule 
KG-501  significantly  suppressed  IL-l|3-induced  CXC  chemo¬ 
kine  gene  expression  and  in  turn  reduced  the  CM-induced 
endothelial  cell  migration  indicate  that  KG-501  may  have  ther¬ 
apeutic  and  preventive  potential  for  NSCLC. 

Our  data  showing  that  IL-1(3>  up-regulates  the  expression  of 
angiogenic  CXC  chemokine  genes  and  augments  the  angio¬ 
genic  activity  of  NSCLC  are  consistent  with  previous  reports 
on  the  role  of  CXCR2  and  its  ligands  in  promoting  tumor- 
associated  angiogenesis  and  early  development  of  NSCLC 
(22,  23,  37-40).  In  an  in  vivo  study  using  murine  Lewis  lung 
cancer  heterotopic  and  orthotopic  tumor  model  systems  with 
CXCR2-/_  versus  CXCR2+/+  mice,  researchers  showed  that 
the  tumors  in  CXCR2  /  mice  exhibited  reduced  growth,  in¬ 
creased  necrosis,  inhibited  tumor-associated  angiogenesis, 
and  reduced  metastatic  potential  (37).  Similar  to  our  finding 
that  a  neutralizing  antibody  against  CXCR2  blocked  CM-in¬ 
duced  endothelial  cell  migration,  the  report  showed  that  a 
specific  neutralizing  antibody  against  CXCR2  inhibited  tumor 
growth,  increased  necrosis,  and  reduced  tumors  vessel  density 
in  CXCR2+/+  mice  (37).  Furthermore,  studies  showed  that 
CXCL5  and  CXCL8  play  a  dominant  role  in  promoting  angio¬ 
genesis  in  patients  with  NSCLC  (22,  23,  40).  Whereas  CXCL8 
was  the  first  angiogenic  ELR-positive  CXC  chemokine  discov¬ 
ered  in  NSCLC,  CXCL5  reportedly  has  a  higher  degree  of  cor¬ 
relation  with  NSCLC-derived  angiogenesis  (23).  In  our  study 


using  the  neutralizing  antibodies,  we  observed  that  CXCL5 
neutralization  inhibited  the  migration  of  endothelial  cells  to 
the  same  degree  as  did  CXCR2  neutralization.  We  failed  to 
see  this  inhibitory  effect  using  the  CXCL8-neutralizing  anti¬ 
body,  indicating  that  CXCL8  produced  by  A549  cells  in  re¬ 
sponse  to  IL-1|3  may  not  be  sufficient  to  induce  endothelial 
cell  migration.  Our  experiment  using  KG-501  further  sup¬ 
ported  such  observation,  as  this  small  molecule  blocked  en¬ 
dothelial  cell  migration  without  affecting  the  CXCL8  level  in 
CM.  In  addition,  another  angiogenic  factor,  VEGF,  may  play 
only  a  minor  role  in  inducing  endothelial  cell  migration  in 
NSCLC,  as  neutralization  of  which  could  not  inhibit  the  mi¬ 
gration  of  HUVECs.  Because  high  levels  of  CXCL5  and 
CXCL8  protein  expression  were  detected  in  IL-l|3-treated 
NSCLC  cells  but  only  a  very  low  level  of  CXCL5  protein 
was  induced  in  NHTBE  cells,  we  speculated  that  IL-1  (S  may 
induce  an  angiogenic  response  only  in  NSCLC  tumor  cells 
but  not  in  surrounding  normal  cells. 

It  is  well  documented  that  IL-1  (3>  up-regulates  the  expres¬ 
sion  of  the  proangiogenic  CXC  chemokine  genes  and  that 
NF-kB  is  the  common  transcription  factor  that  mediates  this 
effect  (31,  41,  42).  All  of  the  angiogenic  CXC  chemokine  gene 
promoters  contain  a  putative  ds-element  that  is  recognized 
by  the  NF-kB  family  of  transcriptional  factors  (31,  33,  43). 
Consistent  with  these  findings,  our  results  showed  that  after 
knockdown  of  NF-kB  p65  by  siRNA  transfection,  both  basal 
and  IL-l|3-induced  expression  of  CXC  chemokine  genes  de¬ 
creased  dramatically  in  A549  and  H1734  cells.  IL-1(3>  can  also 
regulate  gene  expression  through  the  transcription  factor 
CREB.  Previously,  we  reported  that  IL-1(3>  activates  the  mito¬ 
gen-activated  protein  kinase  (ERK1/ 2) /mitogen-  and  stress- 
activated  protein  kinase/CREB  pathway  and  regulates 
MUC5AC  gene  expression  in  human  airway  epithelial  cells 
(30).  Sequence  analysis  of  the  CXC  chemokine  gene  promo¬ 
ters  identified  a  CRE  or  CRE-like  domain  in  the  gene 
CXCL1,  CXCL2 ,  CXCL3,  CXCL5,  and  CXCL8  (31,  32,  34).  In 
the  present  study,  we  showed  that  NSCLC  cells  with  CREB 
knockdown  were  much  less  responsive  to  IL-1  (3>  than  cells 
without  such  knockdown  in  terms  of  the  induction  of  angio¬ 
genic  CXC  chemokine  gene  expression.  We  further  con¬ 
firmed  the  binding  of  CREB  to  the  CXCL5  and  CXCL8 
promoters  with  chromatin  immunoprecipitation  assay.  These 
data  suggested  that  CREB  can  also  regulate  the  expression  of 
CXC  chemokine  genes.  It  has  been  reported  that  cyclooxy¬ 
genase-2  is  critical  for  IL-l|3-induced  angiogenesis  both 
in  vitro  and  in  vivo  through  the  production  of  prostanoids 
such  as  prostaglandin  E2  and  thromboxane  A2  (44).  In  addi¬ 
tion,  Pold  et  al.  (40)  reported  that  cyclooxygenase-2  con¬ 
tributes  to  the  progression  of  NSCLC  tumorigenesis  by 
enhancing  the  expression  of  CXCL5  and  CXCL8.  Because  cy¬ 
clooxygenase-2  expression  is  also  regulated  by  CREB  in 
many  cell  types,  including  lung  cell  lines  (45-48),  CREB 
may  regulate  CXC  chemokine  gene  expression  through 
multiple  mechanisms  that  need  to  be  further  delineated.  In 
addition,  the  interaction  between  CREB  and  NF-kB  in  regu¬ 
lating  the  expression  of  these  chemokine  genes  should  be 
further  investigated  because  these  two  transcription  factors 
may  compete  with  each  other  for  the  binding  to  CBP  or 
work  synergistically  to  affect  the  outcome  of  gene  regulation 
(49).  Additionally,  the  response  elements  for  CREB  and 
NF-kB  on  the  promoter  of  CXCL8  gene  are  consecutively 
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Fig.  4.  NF-kB  and  CREB  mediate  IL-1  (3-induced  CXC  chemokine  gene  expression.  A,  transcription  factor  binding  domains  located  in  the  gene  promoters  of  CXCL1, 
CXCL2,  CXCL3,  CXCL5,  and  CXCL8.  B,  A549  cells  were  transfected  with  siRNA  for  NF-kB  p65  or  transduced  with  shCREB  lentivirus;  nontransduced  cells  and 
cells  transfected  with  control  siRNA  or  transduced  with  scrambled  shRNA  were  used  as  controls.  Proteins  were  extracted  from  the  cells  72  h  after  transfection  or 
96  h  after  transduction.  NF-kB  and  CREB  protein  expression  levels  were  detected  by  Western  blot  analysis.  (3-Actin  protein  was  probed  as  a  loading  control. 

C,  A549  and  HI  734  cells  were  transfected  with  siRNA  for  NF-kB  p65,  and  cells  transfected  with  control  siRNA  were  used  as  control.  Seventy-two  hours  later,  the  cells 
were  treated  with  either  control  medium  or  medium  containing  2.5  ng/ml_  IL-1  (3  for  8  h. 
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Fig.  4  Continued.  D,  A549  and  HI  734  cells  were  transduced  with  shCREB,  and  nontransduced  cells  and  cells  transduced  with  scrambled  shRNA  were  used  as 
controls.  Ninety-six  hours  later,  the  cells  were  treated  with  or  without  2.5  ng/mL  IL-1  (3  for  24  h.  Total  RNAs  were  isolated  from  the  cells  in  C  and  D.  Real-time  PCR  was 
done  with  specific  primers  for  the  indicated  genes.  Columns,  mean;  bars,  SE.  *,  P  <  0.05;  **,  P  <  0.01 ;  ***,  P  <  0.001 . 


located,  which  further  complicated  the  interaction  of  these 
two  transcription  factors  and  might  have  also  contributed 
to  the  perplexing  pattern  of  CXCL8  expression  in  response 
to  KG-501  treatment. 

In  the  IVLCM  cell  lines,  we  detected  that  the  expression  of 
the  angiogenic  CXC  chemokine  genes  CXCL5  and  CXCL8  in¬ 
creased  progressively  from  normal  (NHTBE),  immortalized 
(BEAS-2B  and  1799),  and  transformed  (1198)  to  tumorigenic 
(1170-1)  human  bronchial  epithelial  cells.  This  is  not  unex¬ 
pected  because  the  angiogenic  CXC  chemokines  CXCL5  and 
CXCL8  have  been  reported  to  be  elevated  in  NSCLC  tissues 
and  their  expression  is  related  to  tumor  progression  (23,  50, 
51).  Recently,  accumulating  evidence  shows  that  CXC  chemo¬ 
kines  induce  tumorigenesis  by  stimulating  cell  proliferation, 
mediating  cell  survival,  promoting  angiogenesis,  and  facilitat¬ 


ing  tumor  cell  migration  and  invasion  (13,  52).  In  addition,  we 
detected  that  the  expression  and  activation  of  CREB  increased 
correspondingly  with  the  tumorigenicity  in  these  IVLCM  cell 
lines.  The  consistency  of  the  expression  and  activation  of  these 
factors  with  the  expression  of  angiogenic  CXC  chemokine 
genes  in  IVLCM  confirmed  the  regulation  of  CXC  chemokine 
genes  by  CREB  and  NF-kB,  and  lent  support  to  their  involve¬ 
ment  in  lung  carcinogenesis.  CREB  has  been  known  for  its  role 
in  cell  proliferation  and  survival  (53-55).  We  have  recently  re¬ 
ported  that  the  basal  activity  and  expression  level  of  CREB  are 
commonly  higher  in  a  number  of  NSCLC  cell  lines  versus  nor¬ 
mal  cells,  and  the  inhibition  of  CREB  transcription  activity  in¬ 
duces  apoptosis  in  these  NSCLC  cells  by  suppressing  the 
expression  of  CREB-regulated  genes  that  are  involved  in  cell 
proliferation  (56).  Moreover,  we  have  accumulated  data  from 
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archived  tumor  tissue  specimens  showing  that  the  CREB  and 
p-CREB  levels  are  commonly  higher  in  the  lung  tumor  tissues 
versus  the  adjacent  normal  tissues  (57).  By  showing  the  role  of 
CREB  in  tumor  angiogenesis,  the  current  study  further  sug¬ 
gests  that  CREB  can  be  an  effectual  target  for  therapy  as  well 
as  prevention  of  NSCLC. 

KG-501  is  a  small  molecule  that  binds  to  the  KIX  domain  of 
CBP  (35).  It  disrupts  the  interaction  of  CBP  with  CREB  and 
inhibits  the  CREB-dependent  activation  of  cellular  genes. 
KG-501  can  also  disrupt  the  interaction  of  other  factors  with 
CBP,  such  as  NF-kB  (35).  Our  finding  that  KG-501  reduced  the 
NSCLC /IL-1|3  conditioned  medium-induced  migration  of 
HUVECs  by  down-regulating  the  expression  of  the  ELR- 
positive  CXC  chemokine  genes  in  NSCLC  cells  indicates  that 
KG-501  can  be  used  as  a  therapeutic  and/or  preventive  agent 
for  inhibiting  tumor-associated  angiogenesis  in  NSCLC. 
Although  the  CXCL8  promoter  contains  both  a  NF-kB- 
binding  site  and  a  CRE-like  motif,  and  our  knockdown  ex¬ 
periments  show  that  the  depletion  of  either  factor  reduced 
IL-1  (3> -induced  CXCL8  expression  (Fig.  4C  and  D),  KG-501 
was  not  able  to  effectively  inhibit  CXCL8  gene  expression 
and  protein  secretion.  Because  KG-501  disrupts  CREB-CBP 
or  NF-kB-CBP  interaction,  the  regulation  of  CXCL8  gene 
by  these  factors  may  be  mediated  via  a  CBP-independent 
mechanism.  A  recent  study  showed  that  the  transcriptional 
activity  of  CREB  on  CXCL8  promoter  requires  a  different 
coactivator,  termed  transducer  of  regulated  CREB  (TORC1; 
ref.  34),  suggesting  a  different  regulatory  mechanism  beyond 
the  binding  of  CREB  to  CRE.  Nevertheless,  CXCL8  did  not 
seem  to  play  a  critical  role  in  the  induction  of  HUVEC  migra¬ 


tion  as  evidenced  by  the  results  of  CXCL8  neutralization; 
therefore,  KG-501  can  still  effectively  inhibit  NSCLC/IL-1|3 
CM-induced  HUVEC  migration. 

Our  findings  also  implicated  a  positive  association  be¬ 
tween  chronic  obstructive  pulmonary  disease  (COPD)  and 
lung  cancer.  Chronic  obstructive  pulmonary  disease  is  a  pro¬ 
duct  of  chronic  inflammation  that  leads  to  tissue  damage  and 
physiologic  adaptations  (58,  59).  It  has  been  known  for  years 
that  local  inflammation  in  the  lungs  plays  an  important  role 
in  airway  remodeling  and  parenchymal  destruction,  which 
are  effects  typified  by  chronic  obstructive  pulmonary  disease 
(59).  It  is  now  well  recognized  that,  in  addition  to  lung  in¬ 
flammation,  patients  with  chronic  obstructive  pulmonary 
disease  frequently  show  persistent  low-grade  systemic  in¬ 
flammation,  with  the  characteristic  release  of  proinflamma- 
tory  mediators  such  as  IL-1  (3>  and  tumor  necrosis  factor  a 
into  the  circulation  (60).  Considerable  evidence  has  associated 
chronic  inflammation  with  cancer  development.  Our  finding 
that  CREB  and  NF-kB  regulate  proangiogenic  CXC  chemokines 
in  response  to  proinflammatory  cytokine  IL-1(3>  may  provide  a 
novel  mechanistic  linkage  between  chronic  obstructive  pul¬ 
monary  disease  and  the  development  of  lung  cancer. 

In  summary,  IL-1|3  increases  the  angiogenic  activity  of 
NSCLC  by  up-regulating  the  expression  of  an  array  of  proan¬ 
giogenic  CXC  chemokine  genes,  which  subsequently  induces 
endothelial  cell  migration.  The  transcription  factors  CREB  and 
NF-kB  both  can  mediate  this  effect,  suggesting  that  these 
two  transcription  factors  are  involved  in  tumor-associated 
angiogenesis  and,  therefore,  could  be  potential  targets  for 
the  angioprevention  in  NSCLC.  We  also  conclude  that  the 
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Fig.  5.  The  expression  of  CXC  chemokine  genes  and  CREB  is  associated  with  the  development  of  lung  cancer.  A  and  B,  total  RNAs  from  NHTBE  and 
subconfluent  cultures  of  BEAS-2B,  1799,  1198,  and  1170-1  were  extracted  and  the  expressions  of  designated  genes  were  quantitated  with  real-time  PCR  using 
specific  primers.  Each  specific  gene  expression  was  normalized  with  the  expression  level  of  GAPDH.  Columns,  mean;  bars,  SE.  C,  whole-cell  lysates  (20  pg/lane) 
were  isolated  from  NHTBE  and  IVLCM  cell  lines,  and  the  levels  of  phospho-CREB  (p-CREB)  and  total  CREB  were  measured  by  immunoblot  analysis.  D,  CREB 
and  NF-kB  binding  on  CXCL5  and  CXCL8  promoter.  Chromatin  immunoprecipitation  assay  was  done  with  chromatins  prepared  from  NHTBE,  BEASE-2B,  1799, 1198, 
and  1170-1  cells.  The  binding  of  CREB  or  NF-kB  to  the  CXCL  promoter  was  detected  by  visualization  of  the  PCR  product.  The  single  bands  detected  in 
input  samples  indicate  the  specificity  of  the  PCR  primers. 
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Fig.  6.  KG-501  suppresses  NSCLC/IL-lp  CM-induced  migration  of  HUVECs  by  regulating  IL-1(3-induced  CXC  chemokine  gene  expression  in  NSCLC  cells.  A, 
HUVEC  migration  was  induced  with  CM  from  A549  cells  treated  with  control  medium  (a)  or  with  medium  containing  2.5  ng/mL  IL-ip  (b),  10  pmol/L  KG-501  (c), 

2.5  ng/mL  IL-1  p  plus  2.5  pmol/L  KG-501  (d),  2.5  ng/mL  IL-1  p  plus  5  pmol/L  KG-501  (e),  or  2.5  ng/mL  IL-1  p  plus  1 0  pmol/L  KG-501  (f)  in  serum-free  medium  for  24  h. 
B,  quantification  of  image  analysis  for  the  HUVEC  migration  assay.  C,  total  RNAs  were  isolated  from  cells  with  the  treatment  described  in  A,  and  real-time  PCR 
was  done  to  measure  the  expression  level  of  the  indicated  genes.  D,  the  concentration  of  CXCL5  and  CXCL8  protein  in  the  CM  was  measured  with  ELISA.  Results 
were  normalized  according  to  the  cell  number.  Columns,  mean;  bars,  SE.  **,  P  <  0.01 ;  ***,  P  <  0.001 . 
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small  molecule  KG-501  neutralizes  the  effect  of  NSCLC /IL- 
1  (^-conditioned  medium  on  endothelial  cell  migration  by  inhi¬ 
biting  CREB  and  NF-kB  transcriptional  activity,  which  results 
in  the  down-regulation  of  CXC  chemokine  gene  expression  in 
NSCLC  cells,  suggesting  that  KG-501  may  be  used  as  a  ther¬ 
apeutic  and  angiopreventive  agent  for  NSCLC. 
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Epidermal  Growth  Factor  Receptor  Abnormalities  in  the  Pathogenesis  and 
Progression  of  Lung  Adenocarcinomas 

Ximing  Tang,1  Marileila  Varella-Garcia,3  Ana  C.  Xavier,3  Erminia  Massarelli,1  Natalie  Ozburn,1 
Cesar  Moran2  and  Ignacio  I.  Wistuba1,2 


Abstract  To  identify  the  characteristics  and  sequence  of  epidermal  growth  factor  receptor  (EGFR) 
abnormalities  relevant  to  the  pathogenesis  and  progression  of  lung  adenocarcinoma,  we  per¬ 
formed  a  precise  mapping  analysis  of  EGFR  mutation,  gene  copy  number,  and  total  and  phos- 
phorylated  EGFR  protein  expression  for  the  same  tissue  sites.  We  examined  normal  bronchial 
and  bronchiolar  epithelium  (NBE)  and  tumor  tissues  obtained  from  50  formalin-fixed  lung 
adenocarcinomas,  including  24  EGFF-mutant  primary  tumors  with  nine  corresponding  lymph 
node  metastases  and  26  wild-type  primary  tumors.  NBE  in  12  of  24  (50%)  mutant  and  3  of  26 
(12%)  wild-type  tumors  harbored  EGFR  mutations;  these  NBE  also  showed  a  lack  of  EGFR 
copy  number  increase  and  frequent  EGFR  (69%)  and  phosphorylated  EGFR  (33%)  overex¬ 
pression.  EGFR  mutation  and  protein  overexpression  were  more  frequent  in  NBE  sites  within 
tumors  than  in  NBE  sites  adjacent  to  and  distant  from  tumors,  suggesting  a  localized  field 
effect.  Sites  with  high  and  low  EGFR  copy  numbers  were  heterogeneously  distributed  in  six 
of  nine  primary  tumors  and  in  one  of  eight  metastases.  EGFR  protein  overexpression  was  sig¬ 
nificantly  higher  in  metastasis  sites  than  in  primary  tumors.  We  conclude  from  our  findings  that 
EGFR  mutations  and  protein  overexpression  are  early  phenomena  in  the  pathogenesis  of  lung 
adenocarcinoma  and  that  EGFR  mutation  precedes  an  increase  in  gene  copy  number.  In 
EGFF-mutant  adenocarcinoma  metastases,  the  higher  levels  of  EGFR  overexpression  and 
more  homogeneously  distributed  high  gene  copy  numbers  suggest  tumor  progression.  Our 
findings  have  important  implications  for  the  development  of  new  strategies  for  targeted  che- 
moprevention  and  therapy  in  lung  adenocarcinoma  using  EGFR  inhibitors. 


Epidermal  growth  factor  receptor  (EGFR),  a  tyrosine  kinase 
(TK)  member  of  the  ErbB  family,  has  shown  frequent  abnorm¬ 
alities  in  non-small  cell  lung  carcinomas.  These  abnormalities 
include  protein  overexpression,  gene  amplification,  and  muta¬ 
tion  (1-3).  Somatic  EGFR  mutations  have  been  identified  in 
specific  subsets  of  patients  with  lung  adenocarcinoma,  includ¬ 
ing  never  or  light  smokers,  women,  and  patients  of  East  Asian 
descent  (4).  The  mutations  cluster  in  the  first  four  exons 
(18-21)  of  the  TK  domain  of  the  gene,  and  -90%  of  the  muta¬ 
tions  are  composed  of  either  an  in-frame  deletion  in  exon  19  or 
a  specific  missense  mutation  in  exon  21  (4).  An  increase  in 
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EGFR  gene  copy  number,  including  high  polysomy  and  gene 
amplification  shown  by  fluorescent  in  situ  hybridization 
(FISH),  has  been  detected  in  22%  of  patients  with  surgically 
resected  (stages  I-IIIA)  non-small  cell  lung  carcinomas  and 
correlated  with  EGFR  protein  overexpression  (2).  Higher  fre¬ 
quencies  (40-50%)  of  EGFR  high  copy  number  have  been  re¬ 
ported  in  patients  with  advanced  non-small  cell  lung 
carcinomas  (5-10).  Despite  this  knowledge,  limited  informa¬ 
tion  is  available  on  the  role  of  EGFR  abnormalities  in  the  early 
pathogenesis  and  progression  of  lung  adenocarcinomas. 

Recently,  we  showed  that  mutation  of  the  EGFR  TK  domain 
is  an  early  event  in  the  pathogenesis  of  lung  adenocarcinoma 
and  is  detected  in  histologically  normal  bronchial  and  bronch¬ 
iolar  epithelium  (NBE)  in  43%  of  patients  with  EGFR-mutant 
tumors  (11).  We  found  that  EGFR  mutations  were  more  fre¬ 
quent  in  normal  epithelium  within  the  tumor  (43%)  than  in 
adjacent  sites  (24%),  suggesting  a  localized  field  effect  (11). 
However,  no  comprehensive  information  is  available  regard¬ 
ing  the  role  of  EGFR  abnormalities,  including  gene  mutation, 
increased  copy  number,  and  protein  overexpression  in  the 
early  pathogenesis  and  progression  of  lung  adenocarcinomas. 

Both  EGFR  gene  mutations  and  high  copy  number 
(gene  amplification  and  high  polysomy  identified  by  FISH) 
have  been  associated  with  sensitivity  to  the  small-molecule 
TK  inhibitors  gefitinib  and  erlotinib  in  patients  with  lung 
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adenocarcinoma  (5-18).  However,  some  of  these  results  have 
been  rather  controversial  (9,  10,  19,  20).  In  these  studies  of  ge- 
fitinib  and  erlotinib,  most  of  the  EGFR  mutation  and  copy 
number  analyses  were  done  in  very  small  tissue  samples  or 
in  cytologic  specimens  obtained  from  primary  tumor  and  me¬ 
tastasis  sites  in  patients  with  advanced-stage  lung  cancer  (5-9, 
12-16).  To  date,  no  studies  have  been  done  to  identify  the  char¬ 
acteristics  of  EGFR  gene  and  protein  expression  abnormalities 
at  different  sites  with  respect  to  primary  lung  adenocarcino¬ 
mas  and  in  corresponding  sites  of  metastasis,  information  that 
might  resolve  some  of  the  controversy. 

To  identify  the  sequence  of  EGFR  abnormalities  involved  in 
the  pathogenesis  and  progression  of  lung  adenocarcinoma,  we 
did  a  precise  mapping  analysis  correlating  EGFR  mutation, 
gene  copy  number,  and  protein  expression  in  NBE  fields,  pri¬ 
mary  tumors,  and  corresponding  lymph  node  metastases  that 
were  obtained  from  50  patients  with  lung  adenocarcinomas, 
including  24  patients  with  EGFR-mutant  primary  tumors  with 
nine  corresponding  lymph  node  metastasis  sites  and  26  pa¬ 
tients  with  E GFR-wild-type  primary  tumors. 

Materials  and  Methods 

Case  selection 

To  map  EGFR  gene  and  protein  expression  abnormalities,  we 
obtained  formalin-fixed,  paraffin-embedded  lung  adenocarcinoma  tis¬ 
sue  specimens  from  the  Lung  Cancer  Specialized  Program  of  Research 
Excellence  Tissue  Bank  at  The  University  of  Texas  M.  D.  Anderson 
Cancer  Center  (Houston,  TX).  The  tumor  tissue  specimens  came  from 
50  patients  with  surgically  resected  lung  adenocarcinomas  (tumor- 
node-metastasis  stage  I-IIIA)  with  known  EGFR  mutations  in  exons 
18  to  21,  as  described  previously  (3,  11).  This  bank  was  approved  by 
the  M.  D.  Anderson  Cancer  Center  Institutional  Review  Board. 

Of  these  50  patients,  24  patients  had  lung  adenocarcinoma  with 
EGFR  mutations  in  exon  18  (n  =  1),  exon  19  ( n  =  13),  and  exon  21 
(n  =  10),  and  26  patients  had  EGFR-wild-type  lung  adenocarcinoma. 
The  patients'  clinicopathologic  features  are  summarized  in  Table  1.  All 
lung  adenocarcinomas  were  of  mixed  histologic  subtype  (WHO  clas¬ 
sification;  ref.  21).  None  of  the  patients  had  received  cytotoxic  and/or 
targeted  therapy.  Clinical  staging  was  based  on  the  revised  Interna¬ 
tional  System  for  Staging  Lung  Cancer  (22). 

EGFR  abnormality  mapping 

We  retrospectively  reviewed  H&E-stained  histology  sections  of  pri¬ 
mary  tumor,  lymph  node  metastases,  and  adjacent  normal  lung  tissue 
specimens  to  identify  tissue  foci  available  for  EGFR  abnormality  ana¬ 
lyses.  The  EGFR  abnormalities  included  EGFR  mutations  in  exons  18 
and  21,  as  shown  by  microdissection  and  PCR-based  sequencing; 
EGFR  copy  number,  as  shown  by  FISH;  and  total  EGFR  and  phos- 
phorylated  EGFR  (pEGFR),  as  shown  by  immunohistochemical  ana¬ 
lyses.  Representative  examples  of  these  molecular  changes  are 
illustrated  in  Fig.  1. 

We  used  serial  5-gm-thick  histology  sections  for  the  tissue  microdis¬ 
section,  FISH,  and  immunohistochemical  analyses.  We  identified  a 
total  of  316  noncontiguous  tumor  and  epithelial  foci  from  among  142 
NBE  specimens  (obtained  from  50  patients;  2.84  sites/ patient),  144  pri¬ 
mary  tumors  (from  50  patients;  2.88  sites/ patient),  and  30  lymph  node 
metastases  (from  9  patients;  3.3  sites /patient).  We  examined  NBE  and 
primary  tumors  in  both  EGFR-mutant  and  EGFR-wild-type  cases  and 
metastasis  sites  in  EGFR-mutant  cases  only.  All  epithelial  foci  consisted 
of  normal  or  mildly  hyperplastic  epithelia  that  harbored  small  bronchi 
(65  sites)  and  bronchioles  (77  sites). 

The  NBE  specimens  were  obtained  from  three  different  locations 
based  on  their  relationship  to  the  tumors:  within  the  tumor  (47  sites), 
<5  mm  from  the  tumor  margin  (adjacent  to  tumor;  63  sites),  and 


Table  1.  Clinicopathologic  features  of  patients  with 
lung  adenocarcinomas  examined  for  EGFR 
abnormalities  in  tumors  and  adjacent  normal 
epithelium 


Features/samples 

EGFR  status 

Mutant 
(n  =  24) 

Wild-type 
(n  =  26) 

Total 
(#7  =  50) 

Mean  age  (y) 

61.3 

62.7 

62.1 

Gender 

Female 

19  (79%) 

13  (50%) 

32 

Male 

5  (21  %) 

13  (50%) 

18 

Ethnicity 

East  Asian 

13  (54%) 

9  (35%) 

22 

Not  East  Asian 

11  (56%) 

17  (65%) 

28 

Smoking  history 

Never 

16  (67%) 

9  (35%) 

25 

Former 

7  (29%) 

10  (38%) 

17 

Current 

1  (4%) 

7  (27%) 

8 

Stage  of  disease 

1 

11  (46%) 

15  (58%) 

26 

II 

5  (21  %) 

4  (15%) 

9 

IIIA 

8  (33%) 

7  (27%) 

15 

>5  mm  from  the  tumor  margin  ("distant"  lung;  32  sites).  We  did  not 
detect  squamous  metaplastic  or  dysplastic  lesions  in  the  bronchial 
structures  or  atypical  adenomatous  hyperplasias  in  the  alveolar  tis¬ 
sue.  We  identified  small  bronchi  on  the  basis  of  well-defined  smooth 
muscle  and  discontinuous  cartilage  layers.  Bronchioles  were  defined 
as  small  conducting  airways  lacking  well-defined  smooth  muscle 
wall  or  cartilage  layers.  We  assessed  the  location  of  the  small  bron¬ 
chial  and  bronchiolar  respiratory  epithelium  examined  for  EGFR  ab¬ 
normalities  based  on  the  epithelia's  location  in  relation  to  the  tumor 
tissue  in  the  corresponding  histology  sections,  as  previously  de¬ 
scribed  (11). 

Microdissection  and  DNA  extraction 

Approximately  1,000  cells  were  precisely  microdissected  from 
8-gm-thick,  H&E-stained,  formalin-fixed,  paraffin-embedded  histol¬ 
ogy  sections  for  each  site  using  laser  capture  microdissection  (Arc- 
turus  Engineering  Laser  Capture  Microdissection  System;  MDS 
Analytical  Technologies),  as  previously  described  (11).  To  prevent 
the  nonspecific  binding  of  the  mutant  cells  to  the  microdissection 
cap  film,  the  microdissected  tissue  samples  were  redissected  from 
the  film  under  stereomicroscope  visualization  using  fine  needles 
(25-gauge  5/8-inch  needles).  We  then  extracted  the  DNA  using 
25  gL  of  PicoPure  DNA  Extraction  solution  containing  proteinase  K 
and  incubated  the  DNA  at  65°C  for  20  h.  Subsequently,  proteinase  K 
was  inactivated  by  heating  samples  at  95°C  for  10  min. 

EGFR  mutation  analysis 

Exons  18  and  21  of  EGFR  were  PCR-amplified  using  DNA  ex¬ 
tracted  from  microdissected  NBE  and  tumor  cells,  as  previously  de¬ 
scribed  (3,  11).  Each  PCR  was  done  using  HotStarTaq  Master  Mix 
(Qiagen)  for  40  cycles  at  94°C  for  30  s,  63°C  for  30  s,  and  72°C  for 
30  s,  followed  by  a  7-min  extension  at  72°C.  PCR  products  were 
directly  sequenced  using  the  Applied  Biosystems  PRISM  dye  termi¬ 
nator  cycle  sequencing  method  (Perkin-Elmer  Corp.).  We  confirmed 
all  sequence  variants  by  independent  PCR  amplifications  from  at 
least  two  independent  microdissections  and  sequenced  the  variants 
in  both  directions. 
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EGFR  FISH  analysis 

We  analyzed  the  gene  copy  number  per  cell  using  the  LSI  EGFR 
SpectrumOrange/CEP  7  SpectrumGreen  Probe  (Abbott  Molecular), 
as  previously  described  (5).  Histology  sections  were  incubated  at 
56°C  overnight  and  deparaffinized  by  washing  in  CitriSolv  (Fisher 
Scientific).  After  incubation  in  2x  SSC  buffer  (pH  7.0)  at  75°C  for  15 
to  25  min,  the  histology  sections  were  digested  with  proteinase  K 
(0.25  mg/mL  in  2x  SSC)  at  37°C  for  15  to  25  min,  rinsed  in  2x  SSC 
(pH  7.0)  at  room  temperature  for  5  min,  and  dehydrated  using  ethanol 
in  a  series  of  increasing  concentrations  (70%,  85%,  100%).  We  applied 
the  EGFR  SpectrumOrange/CEP  7/ SpectrumGreen  probe  set  (Abbott 
Molecular)  onto  the  selected  area,  according  to  the  manufacturer's 
instructions,  on  the  basis  of  the  tumor  foci  seen  on  each  slide.  We  then 
covered  the  hybridization  area  with  a  glass  coverslip  and  sealed  the 
coverslip  with  rubber  cement.  The  slides  were  incubated  at  80°C  for 
10  min  for  codenaturation  of  chromosomal  and  probe  DNA  and  then 
placed  in  a  humidified  chamber  at  37°C  for  20  to  24  h  to  allow  hybri¬ 
dization  to  occur.  Posthybridization  washes  were  done  in  1.5  mol/L  of 
urea  and  0.1  x  SSC  (pH  7.0-7.5)  at  45°C  for  30  min  and  in  2x  SSC  for 
2  min  at  room  temperature.  After  the  samples  were  dehydrated  in  a 
series  of  increasing  ethanol  concentrations,  4',6/-diamidino-2-pheny- 
lindole  (0.15  mg/mL  in  Vectashield  Mounting  Medium;  Vector 
Laboratories)  was  applied  for  chromatin  counterstaining.  FISH  analy¬ 
sis  was  done  independently  by  two  authors  (M.  Varella-Garcia  and  A. 
C.  Xavier),  who  were  blinded  to  the  patients'  clinical  characteristics 
and  all  other  molecular  variables.  Patients  were  classified  into  six 
FISH  strata  according  to  the  frequency  of  cells  with  the  EGFR  gene 


copy  number  and  referred  to  the  chromosome  7  centromere,  as  fol¬ 
lows:  (a)  disomy  (  >3  copies  in  <10%  of  cells);  ( b )  low  trisomy  (3  copies 
in  10%  to  40%  of  the  cells,  >4  copies  in  <10%  of  cells);  (c)  high  trisomy 
(3  copies  in  >40%  of  cells,  >4  copies  in  <10%  of  cells);  ( d )  low  polys- 
omy  (>4  copies  in  10-40%  of  cells);  ( e )  high  polysomy  (>4  copies  in 
>40%  of  cells);  and  (f)  gene  amplification  (ratio  of  EGFR  gene  to  chro¬ 
mosome  >2,  presence  of  tight  EGFR  gene  clusters  and  15  copies  of 
EGFR  per  cell  in  10%  of  the  analyzed  cells).  The  high  polysomy  and 
gene  amplification  categories  were  considered  to  be  high  EGFR  copy 
number,  and  the  other  categories  were  considered  to  be  nonincreased 
EGFR  copy  number,  as  previously  published  (5).  Analysis  was  done  in 
approximately  50  nuclei  per  tumor  and  epithelial  site,  and  the  section 
of  the  area  was  guided  by  image  captured  in  the  H&E-stained  section. 

Immunohistochemical  staining 

Tissue  histology  sections  for  immunohistochemical  analyses  were 
deparaffinized,  hydrated,  heated  in  a  steamer  for  10  min  with 
10  mmol/L  of  sodium  citrate  (pH  6.0)  for  antigen  retrieval,  and 
washed  in  Tris  buffer.  Peroxide  blocking  was  done  with  3%  H202 
in  methanol  at  room  temperature  for  15  min,  followed  by  10%  bovine 
serum  albumin  in  TBS  with  Tween  20  for  30  min  at  room  tempera¬ 
ture.  For  the  EGFR  analysis,  tissue  sections  were  incubated  for  2  h 
with  primary  antibodies  against  the  EGFR  clone  31G7  (1:100  dilution; 
Zymed)  and  pEGFR  Tyr  1086  (1:100  dilution;  Invitrogen).  Tissue  sec¬ 
tions  were  then  incubated  for  30  min  with  the  secondary  antibody 
(EnVision+  Dual  Link  labeled  polymer;  DAKO),  after  which  diamino- 
benzidine  chromogen  was  applied  for  5  min.  The  slides  were  then 
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Fig.  1.  A  representative  case  of  EGFR-mutant  lung  adenocarcinoma:  EGFR  gene  and  protein  expression  abnormalities  in  NBE  ( A-E ),  primary  tumor  ( F-J ),  and 
lymph  node  metastasis  ( K-O )  sites.  Histologic  characteristics  [A,  F,  and  K)  of  tissue  sections  stained  with  H&E  (magnification,  x100).  PCR-based  EGFR  sequencing 
( B ,  G  and  L)  of  the  same  EGFR  mutation  in  exon  21  (L858R,  black  arrowhead)  in  NBE  (B),  primary  tumor  (G),  and  lymph  node  metastasis  sites  ( L ).  EGFR  FISH  analysis 
(C,  H  and  M)  of  low  trisomy  (low  copy  number)  in  the  NBE  sample  (C),  high  polysomy  in  the  primary  tumor  site  (H),  and  gene  amplification  ( M)  in  the  metastasis  site. 
Immunohistochemical  analysis  (D,  I,  N,  E,  J,  and  O)  of  high  EGFR  and  pEGFR  expression  in  the  membrane  and  cytoplasm  in  all  three  types  of  samples. 
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Table  2.  Frequency  of  EGFR  gene  mutation  and  protein  overexpression  in  histologically  normal  bronchial  and 
bronchiolar  epithelium  obtained  from  EGFR-mutant  and  wild-type  lung  adenocarcinomas 


EGFR  abnormality  in  NBE 

Cases 

Sites 

Mutant 

Wild -type 

Total 

Mutant 

Wild-type 

Total 

Mutation  by  sequencing 

Number 

24 

26 

50 

85 

57 

142 

Mutant 

12  (50%)* 

3  (12%)* 

15  (30%) 

22  (26%) 

8  (14%) 

30  (21%) 

Protein  overexpression  by  immunohistochemistryt 

Number 

23 

26 

49 

78 

56 

134 

EGFR 

19  (83%) 

15  (58%) 

34  (69%) 

52  (67%) 

35  (63%) 

87  (65%) 

pEGFR 

10  (44%) 

6  (23%) 

16  (33%) 

24  (31  %) 

12  (21%) 

36  (27%) 

*P  =  0.003. 

tPositive  immunohistochemical  overexpression  score  >200  (range,  0-400). 


counterstained  with  hematoxylin  and  topped  with  a  coverslip.  For 
EGFR  and  pEGFR  expression,  antibody  specificity  was  confirmed 
using  blocking  peptide  and  phosphatase  incubation  experiments. 
For  the  control  experiments,  we  used  formalin-fixed  and  paraffin-em¬ 
bedded  pellets  from  lung  cancer  cell  lines  with  confirmed  EGFR  and 
pEGFR  overexpression.  Thyroid  transcription  factor-1  (TITF-1)  anti¬ 
body  (1:100  dilution.  Cell  Marque)  was  used  for  the  identification 
of  TITF-l-positive  cells.  All  four  antibodies  were  incubated  for 
1.5  h  at  room  temperature.  Immunohistochemistry  results  were 
scored  jointly  by  two  authors  (X.  Tang  and  I.I.  Wistuba),  who  were 
blinded  to  clinical  and  other  molecular  variables.  Immunostaining  of 
the  cell  membrane  and  cytoplasm  for  EGFR  and  pEGFR  was  evalu¬ 
ated  by  light  microscopy  (magnification,  x20).  A  semiquantitative 
approach  was  used  to  generate  a  score  for  each  tissue  site,  as  pre¬ 
viously  described  (2,  23,  24).  Membrane  and  cytoplasm  stains  were 
recorded  separately.  We  defined  the  intensity  score  as  follows:  0,  no 
appreciable  staining  in  the  NBE  or  malignant  cells;  1,  barely  detect¬ 


able  staining  in  NBE  or  malignant  cells  compared  with  the  stromal 
elements;  2,  readily  appreciable  staining;  3,  dark  brown  staining  of 
cells;  and  4,  very  strong  staining  of  cells.  The  score  was  also  based 
on  the  fraction  of  cells  showing  a  given  staining  intensity  (0-100%). 
We  calculated  the  immunohistochemical  scores  by  multiplying  the  in¬ 
tensity  and  extension,  and  the  scores  ranged  from  0  to  400.  For  the 
statistical  analyses,  scores  of  0  to  200  signified  negative /low  expres¬ 
sion,  and  scores  >200  indicated  positive /overexpression,  as  pre¬ 
viously  reported  (2,  23,  24).  For  the  evaluation  of  nuclear  TITF-1 
immunohistochemical  expression,  200  epithelial  cells  were  quantified 
by  light  microscopy  (magnification,  x20),  and  a  score  (range,  0-100) 
expressing  the  percentage  of  positive  cells  was  obtained. 

Statistical  analysis 

All  relationships  between  categorical  variables  were  assessed  using 
%2  and  Fisher's  exact  tests.  P  <  0.05  values  were  considered  statistically 
significant. 


Table  3.  EGFR  mutation  and  protein  overexpression  in  histologically  normal  epithelium  by  location 


EGFR  abnormality  in  NBE 

Location  in  relation  to  the  tumor 

Structure 

Inside 

Adjacent 

Distant 

Bronchiole 

Small  bronchus 

Mutation 

Mutant  tumor 

11/31  (36%)* 

10/35  (29%) 

1/17  (6%)* 

10/43  (23%) 

12/42  (29%) 

Wild-type  tumor 

2/15  (13%) 

3/28  (11%) 

1/15  (7%) 

4/34  (12%) 

2/23  (9%) 

All  tumors 

13/46  (28%) 

13/63  (21%) 

2/32  (6%) 

14/77  (19%) 

14/65  (22%) 

EGFR  overexpression* 

Mutant  tumor 

24/29  (83%)* 

20/33  (61%)* 

8/16  (50%)* 

18/38  (47 %)§ 

34/40  (85%)® 

Wild-type  tumor 

10/15  (67%) 

17/28  (61%) 

8/13  (62%) 

14/33  (42%)® 

21/23  (91%)® 

All  tumors 

34/44  (77%) 

37/61  (61%) 

16/29  (55%) 

32/71  (45%)® 

55/63  (87%)® 

pEGFR  overexpression1" 

Mutant  tumor 

13/29  (45%) 11 

5/33  (15%)" 

6/16  (38%)" 

10/38  (26%) 

14/40  (35%) 

Wild-type  tumor 

5/15  (33%) 

5/28  (18%) 

2/13  (15%) 

2/33  (6%)® 

10/23  (44%)® 

All  tumors 

18/44  (41%) 

10/61  (16%) 

8/29  (28%) 

12/71  (17%)11 

24/63  (38%)11 

Comparison  of  NBE  from  inside  tumor  vs.  NBE  distant  (P  =  0.02). 

"^Positive  immunohistochemical  overexpression  score  >200  (range  0-400). 
Comparison  of  NBE  from  inside  tumor  vs.  NBE  adjacent  +  distant  (P  =  0.02) 
Comparison  of  NBE  from  bronchiole  vs.  small  bronchus  (P  <  0.001). 
Comparison  of  NBE  from  inside  tumor  vs.  NBE  adjacent  +  distant  (P  =  0.038). 
Comparison  of  NBE  from  bronchiole  vs.  small  bronchus  (P  =  0.006). 
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Results 

EGFR  abnormalities  in  the  early  pathogenesis  of  lung 
adenocarcinomas 

Patterns  of  EGFR  mutation  in  NBE.  We  previously  reported 
our  finding  of  mutations  in  exons  19  and  21  of  EGFR  in  at  least 
one  site  of  microdissected  NBE  obtained  from  lung  cancer 
specimens  from  9  of  21  (43%)  patients  with  EGFR- mutant 
adenocarcinomas,  with  no  such  mutations  found  in  any  of  26 
respiratory  epithelium  foci  from  16  patients  with  wild-type 
tumors  (11).  In  the  present  study,  using  the  same  methodol¬ 


ogy,  we  analyzed  for  EGFR  mutation  in  NBE  obtained  from 
an  additional  3  patients  with  an  EGFR- mutant  and  10  patients 
with  EGFR-wild-type  lung  adenocarcinomas.  Combining 
both  data  sets,  the  overall  rate  of  mutation  in  NBE  from 
EGFR- mutant  tumors  was  50%.  In  the  wild-type  tumor  cases, 
we  detected  EGFR  exon  19  deletions  (15  bp,  746-750)  in  six 
sites  of  small  bronchial  (n  =  4)  and  bronchiolar  (n  =  2)  NBE 
obtained  from  three  wild-type  tumors  (Table  2).  Thus,  an 
EGFR  mutation  was  found  in  NBE  in  3  of  26  (12%)  wild-type 
adenocarcinomas  and  in  8  of  57  (14%)  of  the  microdissected 
epithelial  sites  (Table  2). 


A 


Case  # 
1 
2 

3 

4 

5 

6 

7 

8 
9 


Primary  Tumor 


□  IlilUl 
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Cases  with  Homogeneous  Mutation  Pattern  5/9  (44%) 
Sites  with  Mutation  54  /  56  (96%) 


Metastasis 
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7  /  8  (88%) 
25  /  30  (83%) 


'  |  Exon  19  deletion  746-750  [|j|  Exon  1 9 1.747  P 
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Cases  with  Identical  Copy  Number  Pattern 

2  /  9  (22%) 

3  /  8  (38%) 

Cases  with  Only  High  Copy  Number  Category 

3  /  9  (33%) 

7/8(87%) 

Sites  with  High  Copy  Number 

33  /  42  (79%) 

25  /  29  (86%) 

No  increased  Copy  Number 
(ff)  High  Polysomy 


Gene  Amplification 


Fig.  2.  A  EGFR  mutation  pattern  in  56  primary  tumor  and  30  lymph  node  metastasis  sites  obtained  from  nine  patients  with  EGFR- mutant  lung  adenocarcinomas. 
A  homogeneous  mutation  pattern  was  detected  in  five  primary  tumors  (cases  2,  3,  4,  7,  and  9)  and  all  but  one  (case  6)  metastasis  case.  Case  6,  mixed  wild-type 
and  mutant  sites  in  both  primary  tumor  sites  and  corresponding  metastases.  B,  EGFR  copy  number  pattern  shown  by  FISH  in  42  primary  tumor  and  29  lymph 
node  metastasis  sites  obtained  from  nine  patients  with  EGFF?-mutant  lung  adenocarcinomas.  Different  FISH  copy  number  categories  (low  vs.  high)  were  found  in 
six  of  nine  primary  tumors  and  in  one  of  eight  corresponding  metastases.  Positive  EGFR  FISH  expression  included  high  polysomy  and  gene  amplification,  and 
negative  EGFR  FISH  expression  included  disomy  and  trisomy. 
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Table  4.  Summary  of  EGFR  abnormalities  by  sites 
in  nine  primary  lung  adenocarcinomas  and 
corresponding  lymph  node  metastases 


EGFR  abnormality/ 
number  of  sites 

Primary  tumor 

Metastases 

Mutation 

Number  of  sites  examined 

56 

30 

Mutation  positive 

54  (96%)* 

25  (83%)* 

Copy  no. 

Number  of  sites  examined 

42 

29 

Low  copy  no. 

9  (21  %) 

4  (14%) 

High  copy  no. 

33  (79%) 

25  (86%) 

High  polysomy 

22  (52%) 

18  (62%) 

Gene  amplification 

11  (26%) 

7  (24%) 

Protein  overexpression1" 

Number  of  sites  examined 

65 

31 

EGFR 

42  (65%)* 

30  (97%)* 

pEGFR 

9  (14%)§ 

21  (68%)® 

*The  same  case  harbored  two  primary  tumor  and  five  metastasis 
sites  with  EGEE-wild-type  sequence. 

f  Positive  immunohistochemical  expression  score  >200  (range 
0-400). 

^Primary  tumor  vs.  metastasis  (P  =  0.02). 

§Primary  tumor  vs.  metastasis  {P  =  0.00001). 


The  combined  data  showed  that  NBE  with  mutant  EGFR 
was  detected  in  the  small  bronchi  (13  of  64,  20%)  and  bronch¬ 
ioles  (17  of  78,  22%)  of  both  mutant  and  wild-type  tumor 
cases.  Overall,  however,  the  mutation  frequency  was  higher 
in  NBE  samples  microdissected  from  within  the  tumor  (13  of 
47,  28%)  than  in  samples  obtained  from  adjacent  tissue  and 
tissue  distant  from  the  tumors  (17  of  95,  18%;  Table  3). 

In  our  previously  reported  comparison  of  NBE  and  corre¬ 
sponding  tumors  (16  specimens),  we  always  observed  identi¬ 


cal  EGFR  mutations  in  both  sites  examined  (11).  In  this  study, 
we  have  expanded  the  number  of  NBE  sites  ( n  =  85)  examined 
for  the  mutation  in  patients  with  EGFR- mutant  adenocarcino¬ 
mas  and  detected  five  sites  (6%)  from  three  cases  in  which 
NBE  showed  mutations  different  from  the  ones  detected  in 
the  corresponding  tumor  specimens  (data  not  shown).  Impor¬ 
tantly,  in  all  cases  with  a  mutation  in  NBE,  an  identical  muta¬ 
tion  was  detected  in  at  least  one  site  of  the  corresponding 
tumor  specimen.  Thus,  in  this  expansion  of  our  previous 
study  (11),  a  relatively  more  heterogeneous  EGFR  mutation 
pattern  of  the  respiratory  field  was  detected  in  NBE  microdis¬ 
sected  from  mutant  lung  adenocarcinomas,  but  most  NBE  and 
corresponding  tumors  shared  the  same  mutation. 

EGFR  copy  number  and  correlation  with  gene  mutation  in 
NBE.  To  determine  the  morphologic  stage  at  which  EGFR  copy 
abnormalities  arise  in  EGFR-mutant  adenocarcinomas,  we  did 
a  precise  mapping  analysis  and  examined  EGFR  copy  number 
in  21  NBE  sites  obtained  from  nine  mutant  adenocarcinomas 
using  FISH.  All  nine  tumor  specimens  showed  at  least  one  site 
with  a  high  copy  number.  These  epithelial  sites  were  also  exam¬ 
ined  in  the  EGFR  mutation  analysis.  Most  NBE  (14  of  21,  67%) 
showed  no  EGFR  FISH  abnormalities  (disomy),  including  four 
EGFR-mutant  sites  with  exon  19  (15  bp)  deletions  and  exon  21 
(L858R)  point  mutations.  Trisomy  was  detected  in  seven  (33%) 
NBE  sites  obtained  from  six  (67%)  cases.  We  did  not  identify 
any  NBE  with  EGFR  amplifications  or  a  high  level  of  polysomy, 
which  have  been  defined  as  high  gene  copy  number.  In  con¬ 
trast,  the  nine  tumors  mapped  showed  significantly  higher  fre¬ 
quency  of  EGFR  amplification  (11  of  42  sites,  26%;  F  <  0.018)  or  a 
high  level  of  polysomy  (22  of  42, 52%;  F  <  0.001)  compared  with 
NBE.  Our  findings  indicate  that  high  EGFR  copy  number  does 
not  occur  in  peripheral  NBE  in  EGFR-mutant  lung  adenocarci¬ 
nomas  and  that  gene  mutations  precede  copy  number  abnorm¬ 
alities  in  the  sequential  pathogenesis  of  these  tumors. 

EGFR  immunohistochemical  expression  and  correlation 
with  gene  mutation  in  NBE.  We  evaluated  the  level  of  EGFR 
and  pEGFR  protein  expression  in  134  NBEs  obtained  from 
FGFR-mutant  and  wild-type  lung  adenocarcinomas.  Overall,  a 


Fig.  3.  Proposed  sequence  of  EGFR 
abnormalities  occurring  in  the  early 
pathogenesis  and  progression  of 
EGFR-mutant  lung  adenocarcinomas. 
NBE  field,  primary  tumor,  and  metastasis 
sites.  Small  circles,  NBE,  which  acquires 
EGFR  mutations  and  EGFR  protein 
(total  and  phosphorylated)  overexpression 
{gray  circles).  In  the  primary  tumor  stage, 
the  EGFR  copy  number  increases  (high 
polysomy  and  gene  amplification)  in 
small  tumor  foci  {striped  ovals).  In  the 
metastasis  site,  tumor  cells  show  both 
EGFR  mutation  and  high  copy  number 
throughout  most  of  the  lesion. 
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high  level  of  EGFR  (69%)  and  a  moderate  level  of  pEGFR  (33%) 
expression  were  detected  in  NBE  from  patients  with 
tumors  (Table  2).  However,  EGFR  and  pEGFR  were  expressed 
to  a  greater  degree  in  NBE  sites  obtained  from  patients  with 
EGFR-mutant  tumors  than  in  patients  with  wild-type  tumors 
(Table  2),  although  these  differences  were  not  statistically  signif¬ 
icant.  The  frequency  of  EGFR,  but  not  of  pEGFR,  overexpression 
was  higher  in  E GER-wild-type  NBE  sites  (85  of  111,  77%)  than  in 
mutant  sites  (14  of  24,  58%;  P  =  0.039).  Of  interest,  NBE  located 
inside  tumors  showed  the  highest  frequency  of  EGFR  and 
pEGFR  overexpression  compared  with  NBE  located  adjacent 
to  and  distant  from  tumors,  especially  in  EGFR-mutant  tumors 
(Table  3).  Small  bronchi  also  showed  a  higher  frequency  of  over¬ 
expression  of  both  markers  compared  with  bronchioles  (Table 
3).  Thus,  the  overexpression  of  EGFR  and  pEGFR  is  a  common 
event  in  NBE  from  patients  with  lung  adenocarcinomas,  espe¬ 
cially  in  EGFR-mutant  tumors,  and  shows  a  localized  field  phe¬ 
nomenon  effect  similar  to  gene  mutation. 

TITF-1  immunohisto chemical  expression  and  EGFR  muta¬ 
tion  in  NBE.  Recently,  on  the  basis  of  immunohistochemical 
findings  of  higher  levels  of  nuclear  TITF-1  expression,  a  crucial 
transcription  factor  of  the  lung,  in  EGFR-mutant  lung  adeno¬ 
carcinomas  compared  with  in  wild-type  tumors,  it  has  been 
suggested  that  EGFR-mutant  lung  adenocarcinoma  originates 
from  the  terminal  respiratory  unit  (25),  which  is  composed  of 
alveolar  cells  and  nonciliated  bronchiolar  epithelium.  Its  char¬ 
acteristics  are  highlighted  by  the  expression  of  TITF-1  (25).  We 
therefore  investigated  the  correlation  between  EGFR  mutation 
and  TITF-1  nuclear  expression  in  tumor  and  normal  epithe¬ 
lium  sites.  EGFR-mutant  lung  adenocarcinomas  (18  of  20 
cases,  90%)  showed  higher  expression  of  TITF-1  than  did 
wild-type  tumors  (10  of  26  cases,  38%;  P  <  0.001).  However, 
in  immunohistochemical  studies,  we  did  not  see  a  significant 
difference  in  the  frequency  of  TITF-1  expression  between 
EGFR-mutant  (11  of  25  sites,  44%)  and  EGFR-wild-type  (34 
of  105  sites,  33%;  P  =  0.273)  respiratory  epithelia.  Our  findings 
therefore  indicate  that  NBE  cells  expressing  TITF-1  are  not  the 
exclusive  precursors  of  EGFR-mutant  adenocarcinomas.  From 
these  results,  it  is  clear  that  these  tumors  do  not  originate  ex¬ 
clusively  from  terminal  respiratory  unit  structures. 

EGFR  abnormalities  in  the  progression  of  lung 

adenocarcinomas 

EGFR  mutation  pattern  in  primary  tumors  and  correspond¬ 
ing  metastasis.  To  identify  the  characteristics  of  EGFR 
abnormalities  in  the  progression  of  mutant  lung  adenocarci¬ 
nomas,  we  examined  EGFR  gene  mutation,  gene  copy  num¬ 
ber,  and  protein  expression  in  primary  tumors  and 
corresponding  metastases  by  performing  a  detailed  mapping 
analysis  of  tumor  specimens.  For  this  study,  we  selected  nine 
lung  adenocarcinomas  with  known  EGFR  mutations  in  exon 
19  ( n  =  5)  and  exon  21  ( n  =  4),  and  with  lymph  node 
metastases  for  which  there  was  sufficient  tissue  to  perform 
our  mapping  analysis. 

For  the  mutation  analysis  of  EGFR  exons  19  and  21,  we  did 
precise  tissue  microdissection  from  noncontiguous  primary 
tumor  foci  (n  =  56  sites,  6.2  sites/tumor;  range  2-11  sites)  con¬ 
taining  at  least  1,000  cells.  Surprisingly,  four  of  the  nine  pri¬ 
mary  tumorsexamined  showed  mixed  EGFR  gene  patterns 
(Fig.  2A):  three  showed  two  or  more  types  of  mutations, 
and  one  showed  five  sites  with  exon  19  (15  bp,  746-750)  dele¬ 


tion  and  two  sites  with  the  wild-type  EGFR  gene.  EGFR  muta¬ 
tion  analysis  of  30  corresponding  lymph  node  metastasis  sites 
from  the  nine  EGFR-mutant  cases  (3.3  sites/case;  range  1-6 
sites)  detected  only  one  type  of  EGFR  mutation  in  all  tumor 
sites  in  each  case,  and  the  mutation  was  always  present  in 
at  least  one  site  of  the  corresponding  primary  tumor.  Similar 
to  the  corresponding  primary  tumor,  one  metastasis  case 
showed  EGFR-wild-type  (five  sites)  and  EGFR-mutant  [one 
site,  exon  19  (15  bp,  746-750)  deletion]  tumor  sites  (Fig.  2A). 
All  these  findings  were  confirmed  by  sequencing  analyses  of 
independently  microdissected  samples.  In  summary,  our  find¬ 
ings  showed  a  relatively  high  level  of  heterogeneity  for  the 
EGFR  mutation,  and  several  tumor  cell  clones  had  mutation 
patterns  in  the  primary  tumor  specimens  that  differed  from 
the  mutation  patterns  in  the  lymph  node  metastasis  sites. 

EGFR  copy  number  abnormalities  in  primary  tumors  and 
corresponding  metastasis.  We  used  FISH  to  investigate  the 
EGFR  gene  copy  number  abnormalities  in  42  primary  tumor 
sites  (2.1  sites/case;  range  2-7  sites)  and  29  metastasis  sites 
(3.2sites/case;  range  1-6  sites),  which  were  also  examined  for 
the  mutation  analysis.  Overall,  all  primary  tumors  and  corre¬ 
sponding  metastases  showed  at  least  one  site  of  high  gene  copy 
number  (high  polysomy  or  gene  amplification;  Fig.  2B).  How¬ 
ever,  six  (67%)  primary  tumor  cases  and  one  (11%)  metastasis 
case  showed  at  least  one  site  without  high  copy  number  (disomy 
in  one  primary  tumor  site,  high  trisomy  in  one  metastasis  site, 
and  low  polysomy  in  seven  primary  and  three  metastasis  sites; 
Fig.  2B).  Thus,  EGFR  copy  number  heterogeneity  was  higher  in 
primary  tumor  sites  than  in  corresponding  metastasis  sites. 

EGFR  immunohistochemical  expression  in  primary  tumors 
and  corresponding  metastasis  sites.  In  the  nine  EGFR-mutant 
lung  adenocarcinoma  cases  mapped  for  EGFR  abnormalities, 
we  examined  both  primary  tumors  and  the  corresponding 
lymph  node  metastases  for  EGFR  and  pEGFR  immunohisto¬ 
chemical  expression.  For  both  tumor  locations  combined,  96 
distinct  tumor  sites  were  examined  (n  =  65  primary  tumor  sites, 
7.2  sites/case;  and  n-  31  metastasis  sites,  3.4  sites/case).  Sig¬ 
nificantly  higher  levels  of  EGFR  and  pEGFR  expression  were 
detected  in  metastasis  sites  compared  with  primary  tumor  sites 
(Table  4).  No  correlation  between  EGFR  and  pEGFR  expression 
and  EGFR  copy  number  status  by  FISH  was  detected. 

Discussion 

Using  a  detailed  molecular  pathology  mapping  strategy,  we 
determined  the  sequence  of  EGFR  abnormalities  in  the  early 
pathogenesis  of  EGFR-mutant  lung  adenocarcinomas  and  iden¬ 
tified  the  pattern  of  EGFR  changes  in  the  progression  of  EGFR- 
mutant  lung  adenocarcinomas  from  primary  tumors  to  lymph 
node  metastasis.  First,  we  showed  that  EGFR  mutations  pre¬ 
cede  gene  copy  number  abnormalities  in  the  pathogenesis  of 
these  tumors  and  that  EGFR  and  pEGFR  immunohistochemical 
protein  expressions  are  frequent  events  in  histologically  normal 
peripheral  bronchial  and  bronchiolar  epithelium  adjacent  to 
lung  adenocarcinomas.  Second,  our  data  indicated  that 
although  primary  lung  adenocarcinomas  show  some  degree 
of  EGFR  gene  copy  number  heterogeneity,  this  phenomenon 
is  rare  in  metastases.  Although  these  findings  can  be  considered 
tumor  progression  phenomena,  they  also  have  important  clin¬ 
ical  implications  from  the  standpoint  of  making  decisions  re¬ 
garding  the  use  of  EGFR  TK  inhibitor  therapy  on  the  basis  of 
the  finding  of  EGFR  gene  abnormalities. 
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Despite  the  evidence  showing  that  atypical  adenomatous 
hyperplasia  is  a  precursor  of  peripheral  lung  adenocarcinomas 
(26),  there  is  consensus  that  the  pathogenesis  of  most  adeno¬ 
carcinomas  is  unknown.  Our  previously  reported  findings  of 
an  EGFR  mutation  in  NBE  in  9  of  21  (43%)  patients  with 
EGFR-mutant  adenocarcinomas  indicated  that  the  EGFR  gene 
mutation  is  an  early  event  in  the  pathogenesis  of  lung  adeno¬ 
carcinoma  (11).  In  this  study,  we  have  investigated  normal 
epithelium  from  additional  patients  with  EGFR-mutant  or 
wild-type  lung  adenocarcinomas  and  specifically  have  two 
new  findings  in  this  study;  (a)  we  detected  an  EGFR  mutation 
(exon  19, 15  bp  deletion,  746-751)  in  six  sites  of  small  bronchial 
and  bronchiolar  epithelium  obtained  from  three  patients  with 
wild-type  adenocarcinoma,  and  ( b )  whereas  an  identical  mu¬ 
tation  was  detected  in  the  majority  of  specimens  of  mutant 
normal  epithelium  compared  with  the  corresponding  invasive 
tumor  (75%  of  cases  and  94%  of  sites),  we  found  few  normal 
epithelium  sites  (6%)  in  three  of  12  cases  (25%)  of  EGFR- 
mutant  tumors,  demonstrating  the  existence  of  a  different  mu¬ 
tation  pattern  between  normal  epithelium  and  the  correspond¬ 
ing  invasive  tumor.  All  these  data  reinforce  the  concept  of  a 
field  effect  phenomenon  in  EGFR  mutations  in  lung  adenocar¬ 
cinoma  pathogenesis  that  affects  histologically  normal  bron¬ 
chial  and  bronchiolar  respiratory  epithelia. 

We  have  previously  shown  that  molecular  abnormalities 
occur  in  a  stepwise  fashion  in  the  sequential  pathogenesis  of 
squamous  cell  carcinoma  of  the  lung,  with  molecular  changes 
commencing  in  histologically  normal  bronchial  epithelium  in 
smokers  and  in  patients  with  lung  cancer  (27,  28).  Our  find¬ 
ings  suggest  that  EGFR  abnormalities  also  occur  sequentially 
in  the  early  pathogenesis  of  lung  adenocarcinoma,  with  a  mu¬ 
tation  commencing  in  histologically  normal  epithelium  and  a 
high  EGFR  copy  number  appearing  at  the  invasive  tumor 
stage.  A  recent  report  (29)  of  selective  gene  amplification  of 
the  shorter  allele  of  the  EGFR  intron  1  polymorphism  CA  sim¬ 
ple  sequence  repeat  1,  which  is  the  allele  more  frequently  mu¬ 
tated  in  tumors  harboring  an  EGFR  mutation,  also  suggests 
that  mutations  occur  earlier  than  copy  number  abnormalities 
in  the  pathogenesis  of  lung  adenocarcinoma.  Our  findings  of 
frequent  EGFR  (69%)  and  pEGFR  (33%)  protein  overexpres¬ 
sion  in  normal  distal  bronchial  and  bronchiolar  epithelium 
from  patients  with  either  EGFR-mutant  or  EGFR-wild-type 
lung  adenocarcinomas  indicate  a  field  phenomenon  in  the 
peripheral  airway.  A  relatively  high  frequency  of  EGFR  pro¬ 
tein  expression  has  also  been  reported  in  centrally  located, 
histologically  normal  (42%)  and  hyperplastic  (54%)  bronchial 
epithelium  from  smokers  (23).  In  addition,  our  data  indicate 
that  the  mechanisms  of  protein  overexpression  seem  to  be  un¬ 
associated  with  high  gene  copy  number  and  mutation  in  NBE. 
Other  mechanisms  can  explain  EGFR  overexpression  in  nor¬ 
mal  epithelial  cells,  including  ligand-dependent  up-regulation 
and  activation,  as  well  as  inhibition  of  endocytosis-related 
protein  down-regulation  in  the  cell  membrane  (30). 

Based  on  findings  of  higher  levels  of  immunohistochem- 
ical  expression  of  nuclear  TITF-1,  a  crucial  transcription  fac¬ 
tor  of  the  lung,  in  EGFR-mutant  lung  adenocarcinomas 
compared  with  wild-type  tumors,  it  has  been  suggested  that 
EGFR-mutant  lung  adenocarcinoma  originates  from  the 
terminal  respiratory  unit  (25).  We  found  EGFR  mutations 
in  microdissected  histologically  normal  epithelial  cells  from 
small  bronchi  and  bronchioles,  which  supports  the  concept 


of  adenocarcinomas  arising  from  the  peripheral  lung  airway. 
Our  findings  indicate  that  NBE  cells  expressing  TITF-1  are 
not  the  exclusive  precursors  of  EGFR-mutant  adenocarcino¬ 
mas.  From  this  finding,  it  is  clear  that  these  tumors  do  not 
originate  exclusively  from  terminal  respiratory  unit  struc¬ 
tures.  In  addition,  we  cannot  exclude  the  possibility  that 
common  stem  or  progenitor  cells  for  both  bronchial  and 
bronchiolar  epithelium  bear  EGFR  mutations. 

It  has  been  suggested  that  activating  TK  EGFR  mutations 
are  a  potent  oncogenic  event  by  which  mutant  tumor  cells 
become  physiologically  dependent  on  the  continued  activity 
of  the  phosphorylated  protein  for  the  maintenance  of  their 
malignant  phenotype  (31).  Our  detailed  mapping  analysis 
of  the  EGFR  gene  mutation  and  copy  number  of  multiple 
precisely  microdissected  sites  in  nine  mutant  primary  tu¬ 
mors  and  corresponding  lymph  node  metastases  showed 
an  identical  or  monoclonal  pattern  of  mutation  in  most 
(n  =  5)  primary  tumors  and  all  metastases.  These  findings 
corroborate  the  monoclonal  concept  of  tumor  development 
and  the  monoclonal  evolution  of  metastases  (32,  33).  How¬ 
ever,  two  primary  tumors  lacking  identical  or  monoclonal 
EGFR-mutant  patterns  harbored  different  sizes  of  exon  19 
deletions  (12  versus  15  bp  and  15  versus  18  bp  deletions). 
This  finding  could  be  explained  by  a  tumor  progression  phe¬ 
nomenon  in  which  the  deletion  size  changed  during  the  evo¬ 
lution  of  the  malignancy.  However,  two  very  interesting 
primary  tumors  in  our  study  exhibited  findings  that  chal¬ 
lenged  the  concept  of  the  monoclonal  evolution  of  tumors. 
One  case  showed  a  single  site  with  an  exon  19  (15  bp)  dele¬ 
tion,  whereas  the  remaining  eight  sites  lacked  the  deletion  but 
showed  a  point  mutation  (TTA747CCA)  in  the  same  exon.  Of 
interest,  the  three  metastasis  sites  examined  harbored  the 
most  frequent  mutation  detected  in  the  primary  lung  tumor. 
The  other  case  showed  areas  of  wild-type  and  mutant  EGFR 
in  both  primary  tumors  and  metastases,  a  phenomenon  that 
is  difficult  to  explain  and  suggests  that  molecular  events 
other  than  an  EGFR  mutation  may  be  responsible  for  tumor 
development  in  lung  adenocarcinomas.  These  findings  were 
confirmed  by  the  sequencing  of  independently  microdis¬ 
sected  samples.  In  the  latter  case,  the  finding  of  a  high  EGFR 
copy  number  (high  polysomy)  in  wild-type  tumor  sites  raises 
the  possibility  of  an  alternative  explanation — that  the  wild- 
type  allele  is  preferentially  amplified  in  some  tumor  cells. 
As  a  result,  the  mutant  allele  is  underrepresented  and  is  not 
detectable  by  our  current  sequencing  methodology. 

Retrospective  studies  have  provided  data  suggesting  that 
a  high  EGFR  gene  copy  number  shown  by  FISH  is  associated 
with  treatment  response,  time  to  progression,  and  survival  in 
patients  with  advanced  non-small  cell  lung  carcinoma  trea¬ 
ted  with  EGFR  TK  inhibitors  (5-7,  10,  17).  In  these  studies, 
high  EGFR  copy  number  as  shown  by  FISH  was  defined 
as  true  gene  amplification  or  high  polysomy  with  equal  to 
or  more  than  four  EGFR  copies  in  >40%  of  cells  (5,  34). 
Our  mapping  analysis  of  primary  tumors  and  corresponding 
lymph  node  metastases  in  which  we  used  the  same  EGFR 
FISH  criteria  showed  that  a  frequent  high  copy  number  in 
mutant  tumors  was  the  most  frequent  pattern  detected.  De¬ 
spite  the  fact  that  most  primary  tumor  sites  and  nearly  all 
metastasis  sites  showed  high  copy  numbers,  high  polysomy 
and  gene  amplification  were  heterogeneously  distributed  in 
both  tumor  locations.  More  importantly,  five  of  nine  (56%) 
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primary  tumors  and  one  metastasis  (13%)  showed  one  or 
more  sites  without  an  increased  copy  number  (FISH  nega¬ 
tive).  Similarly,  EGFR  and  pEGFR  immunohistochemical  ex¬ 
pression  was  less  heterogeneous  in  primary  tumors  and 
more  frequent  in  metastases.  Taken  together,  these  data  sug¬ 
gest  that  EGFR  copy  number  analyzed  by  FISH  and  protein 
expression  analyzed  by  immunohistochemistry  in  small  core 
biopsy  or  fine-needle  aspiration  specimens  obtained  from 
primary  tumors,  and  more  rarely  from  metastases,  could 
miss  these  molecular  changes,  especially  if  only  a  small  num¬ 
ber  of  malignant  cells  are  available  for  examination.  In  addi¬ 
tion,  if  the  suggested  presence  of  EGFR  high  copy  number 
correlates  with  sensitivity  to  EGFR  TK  inhibitors  (5-7,  17), 
it  is  likely  that  metastases  will  show  a  better  response  to 
therapy  than  will  primary  tumors.  This  is  an  important  con¬ 
sideration,  in  light  of  the  fact  that  the  site  of  origin  (primary 
versus  metastasis)  of  the  tumor  specimen  was  not  reported 
and  factored  into  the  biomarker  analyses  in  any  of  the  clin¬ 
ical  trials  testing  the  efficacy  of  EGFR  TK  inhibitors  in  pa¬ 


tients  with  advanced  non-small  cell  lung  carcinoma  in 
whom  EGFR  copy  number  determined  by  FISH  was  exam¬ 
ined  as  a  predictor  of  response  and  prognosis  (5-7).  Our  re¬ 
sults  show  that  a  better  understanding  of  the  pattern  of 
molecular  abnormalities  and  their  corresponding  biomarker 
expression,  including  primary  tumors  and  the  frequent  me¬ 
tastases  seen  for  this  tumor  type,  is  important  in  lung  cancer. 

In  summary,  our  data  suggest  that  gene  mutations  and 
protein  overexpression  are  the  earliest  phenomena  in  EGFR- 
mutant  lung  adenocarcinoma,  occurring  at  the  NBE  stage, 
and  that  this  is  followed  by  the  development  of  a  focal  in¬ 
crease  in  copy  number  at  the  tumor  stage  (Fig.  3).  At  the  me¬ 
tastasis  sites,  however,  all  three  abnormalities  were  more 
frequent  than  they  were  in  the  primary  tumors  and  were 
homogeneously  distributed  throughout  the  malignant  cells. 
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Chromosomal  Deletions  and  Progression  of  Premalignant  Lesions: 
Less  Is  More 


Ignacio  I.  Wistuba1  and  Matthew  Meyerson2,3 

Epithelial  malignancies  arise  after  a  series  of  progressive 
pathologic  changes  including  hyperplasia,  different  grades 
of  dysplasia,  and  carcinoma  in  situ  (1).  These  premalignant 
changes  can  accompany  cancer  or  may  occur  in  the  epithelium 
of  individuals  at  high  risk.  Sequential  premalignant  lesions 
have  been  defined  for  many  epithelial  tumors,  including  oral 
squamous  cell  carcinoma  (1)  and  Barrett's  esophageal  adeno¬ 
carcinoma  (3).  Two  of  the  most  exciting  areas  of  current  cancer 
research  are  (a)  novel  high-throughput  technology  for  molecu¬ 
lar  studies  of  carcinogenesis  and  (b)  the  use  of  this  technology 
in  discovering  and  characterizing  genetic  abnormalities  that 
underlie  the  progression  of  epithelial  premalignancy 

Many  studies  of  the  last  2  decades  have  helped  to  character¬ 
ize  molecular  changes  of  epithelial  premalignancy.  Encour¬ 
aged  by  the  development  of  methodologies  (such  as  laser 
microdissection)  for  isolating  cells  from  small  histologic  le¬ 
sions,  and  of  techniques  to  do  genomic  studies  on  minute 
amounts  of  DNA,  several  groups  have  made  substantial  pro¬ 
gress  in  unveiling  the  molecular  and  genetic  abnormalities  of 
epithelial  premalignant  lesions  (2,  3-7).  These  changes  involve 
inactivation  of  known  and  putative  tumor  suppressor  genes 
and  several  dominant  oncogenes.  Tumor  suppressor  genes 
are  believed  to  be  inactivated  via  a  two-step  process  involving 
both  alleles.  Knudson  (8)  proposed  that  the  first  "hit"  fre¬ 
quently  is  a  point  mutation,  whereas  the  second  allele  is  sub¬ 
sequently  inactivated  via  a  chromosomal  deletion,  translocation, 
or  other  event  such  as  methylation  of  gene  promoter  regions  (9). 
Dominant  oncogenes  are  frequently  activated  by  mutation, 
increased  copy  number,  and  translocations  (9). 

The  general  working  model  of  sequential  molecular  ab¬ 
normalities  in  the  pathogenesis  of  epithelial  tumors  indicates 
that  genetic  changes  ( a )  commence  in  histologically  normal 
epithelium  and  progress  with  the  increasing  severity  of  epithe¬ 
lial  changes,  (b)  follow  a  sequence  from  early  to  late  changes, 
and  ( c )  are  extensive  and  multifocal  throughout  the  epithe¬ 
lium,  indicating  a  field  effect,  also  known  as  "field  canceriza- 
tion"  (3).  Therefore,  in  various  organ  sites,  multiple  clonal  and 
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subclonal  patches  of  molecularly  abnormal  epithelial  cells 
(with  or  without  cytologic  and  morphologic  abnormalities) 
can  be  detected  throughout  the  affected  epithelium.  New 
high-throughput  genomic  and  proteomic  profiling  techniques 
are  now  starting  to  be  applied  to  premalignant  or  normal¬ 
appearing  tissue  because  these  techniques  are  suitable  for 
studying  the  small  amounts  of  tissue  usually  available  in  these 
precancer  settings. 

Oral  squamous  cell  carcinoma  and  Barrett's  esophageal  ade¬ 
nocarcinoma  are  on  the  frontier  of  innovative  discoveries  in¬ 
volving  molecular  events  in  the  progression  of  premalignant 
lesions  (2,  3).  These  two  diseases  are  good  models  of  cancer 
genetic  progression  because  the  premalignant  epithelium  can 
be  safely  visualized  and  biopsied  so  that  genomic  changes  can 
be  compared  in  different  stages  of  neoplastic  evolution  and 
then  studied  longitudinally  by  biopsy  surveillance.  Several 
studies  have  shown  that  the  general  working  model  of  pro¬ 
gression  described  above  applies  to  both  tumor  types. 

Oral  leukoplakia  is  the  most  common  head  and  neck  pre¬ 
malignant  lesion  and  has  a  malignant  transformation  rate  as 
high  as  24%  (10).  Deletion  of  one  of  the  two  alleles  within 
chromosome  arm  3p  and  chromosome  segment  9p21  is  the 
most  frequent  event  in  oral  lesions  with  only  mild  dysplasia 
and  even  in  some  normal-appearing  epithelial  cells  (11).  Oral 
leukoplakias  with  deletions  in  the  3pl4  ( FHIT )  and  9p21 
(CDKN2A)  regions  are  reported  to  carry  a  higher  risk  for 
transformation  (2).  Barrett's  esophagus  (BE)  is  the  only  known 
premalignancy  of  esophageal  adenocarcinoma;  only  a  small 
fraction  (0.5-1%  annually)  of  BE  patients  subsequently  devel¬ 
op  adenocarcinoma  (12).  Although  less  accessible  than  are  oral 
lesions,  BE  is  a  unique  model  for  the  study  of  human  neoplas¬ 
tic  progression  in  vivo.  The  standard  care  of  BE  requires  biop¬ 
sies  according  to  defined  protocols  at  multiple  time  points 
from  the  same  patient,  allowing  the  generation  of  spatial  maps 
and  longitudinal  evaluation  of  genetic  alterations  that  arise 
during  clonal  evolution.  Several  studies  of  BE  indicate  that 
inactivation  of  CDKN2A  by  loss  of  heterozygosity  (LOH), 
methylation,  and/or  mutation  is  selected  as  an  early  event 
that  predisposes  to  large  clonal  expansions  of  the  BE  tissue 
(12,  13).  Subsequent  inactivation  of  TP53  by  mutation  and 
LOH  predisposes  to  progression  to  aneuploidy  and  invasive 
adenocarcinoma  development  (13).  All  these  studies  have 
been  done  using  low-throughput  methodologies  with  analysis 
of  abnormalities  on  few  chromosomal  foci  or  specific  genes. 

In  the  present  issue  of  the  journal,  Tsui  et  al.  (14)  and  Li  et  al. 
(15)  report  their  high-throughput  analyses  of  genetic  abnor¬ 
malities  in  premalignant  lesions  that  provide  insights  on  chro¬ 
mosomal  changes  in  the  early  pathogenesis  of  oral  squamous 
cell  carcinoma  (14)  and  BE  (15).  These  studies  are  among  the 
first  to  use  high-throughput  DNA  copy  number  analysis  by 
microarrays  in  the  study  of  the  sequential  progression  of  pre¬ 
malignant  lesions  of  any  organ  site.  Comparative  genomic 
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hybridization  (CGH)  profiling  has  been  previously  applied  to 
oral  squamous  cell  carcinoma  (16-18),  leading  to  the  identifi¬ 
cation  of  novel  chromosomal  regions  (e.g.,  5pl5.2  and 
llq.22.2-22.3)  frequently  altered  in  this  neoplasm  (16).  Earlier 
studies  using  GCH  and  single-nucleotide  polymorphism 
(SNP)  array  have  already  shown  that  chromosomal  abnormal¬ 
ities  increase  with  the  progression  of  BE  lesions,  but  they  ana¬ 
lyzed  only  a  few  tissue  specimens  (19,  20).  Both  of  the  present 
studies  showed  that  microarray-based  DNA  copy  number 
analyses  can  reveal  the  progression  of  chromosomal  abnorm¬ 
alities  that  parallels  the  clinical  and  histopathologic  progres¬ 
sion  of  premalignancy.  The  findings  of  these  studies  also 
have  some  potential  to  be  used  to  predict  the  risk  of  progres¬ 
sion  of  either  oral  leukoplakia  or  BE. 

Tsui  et  al.  (14)  used  CGH  to  evaluate  genetic  alterations  on 
chromosome  3p  in  47  oral  premalignant  lesions  with  high- 
grade  dysplastic  features  (severe  dysplasia  and  carcinoma 
in  situ)  and  compared  these  alterations  with  findings  in  23  oral 
squamous  cell  carcinomas;  all  samples  were  formalin-fixed 
and  paraffin-embedded  tissues.  This  study  was  stimulated 
by  previous  laboratory  work  of  this  group  (21,  22)  and  others 
(2,  10,  11),  showing  that  chromosome  3p  is  frequently  altered 
in  the  early  pathogenesis  of  oral  carcinoma  and  has  been 
associated  with  the  risk  of  premalignant  lesion  progression 
(2,  10,  11,  21,  22).  High-grade  dysplastic  lesions  exhibited  six 
recurrent  regions  of  losses  on  3p  including  3p25.3-p26.1, 
3p25.1-p25.3,  3p24.1,  3p21.31-p22.3,  3pl4.2,  and  3pl4.1,  which 
overlapped  losses  found  in  invasive  carcinoma.  Next,  the 
authors  examined  these  regions  in  24  low-grade  dysplastic  le¬ 
sions  with  known  outcomes,  including  2  hyperplasias  and  22 
mild  and  moderate  dysplasias,  and  determined  that  3p  losses 
were  identified  more  frequently  in  low-grade  dysplastic  le¬ 
sions  with  progressive  behavior  (78%)  compared  with  nonpro¬ 
gressing  lesions  (20%).  This  interesting  observation  obviously 
will  require  verification  in  a  larger  cohort  of  patients.  In  our 
opinion,  one  remarkable  finding  of  this  report  is  that  the  size 
of  3p  segmental  losses,  or  discontinuous  LOH  (alternating 
segments  showing  loss  and  retention;  portrayed  in  Fig.  1), 
increased  with  histologic  stage;  segmental  3p  losses  were 
detected  in  premalignant  lesions,  whereas  whole-arm  loss 
occurred  mainly  in  invasive  tumors. 

Li  et  al.  (15)  used  a  medium-density  SNP  array  (containing 
~33,000  SNPs)  to  investigate  genome-wide  chromosomal 
copy  number  changes  in  whole  frozen  tissue  specimens  (ob¬ 
tained  by  endoscopy  or  surgical  resection)  of  multiple  stages 
of  BE  and  esophageal  adenocarcinoma  from  42  patients.  This 
investigation  is  from  the  same  laboratory  that  established  the 
working  model  for  BE  molecular  progression  (3)  and  one  of 
the  first  laboratories  to  use  SNP  arrays  for  the  analysis  of  hu¬ 
man  premalignancy  (23).  The  mucosal  esophageal  tissues  used 
in  this  cross-sectional  study  included  24  early-stage  BE  speci¬ 
mens  (with  or  without  aneuploidy  and  that  did  not  develop 
cancer  during  follow-up),  10  late-stage  BE  specimens  (with 
microscopic  invasion),  and  8  grossly  invasive  esophageal  ade¬ 
nocarcinoma  specimens.  Genome-wide  copy  losses  and  LOH 
(the  most  frequent  changes)  and  copy  gains  increased  in  fre¬ 
quency  and  size  between  early  and  late  BE,  with  SNP  abnorm¬ 
alities  increasing  from  2%  in  early  stages  to  >30%  in  late 
stages.  A  set  of  statistically  significant  events  was  unique  to 
either  early  or  late  BE  stage,  and  few  chromosomal  regions 
with  changes  were  common  in  all  stages  of  progression.  It  is 


interesting  that  the  total  number  of  genome-wide  SNP  altera¬ 
tions  (gains,  losses,  and  LOH)  was  highly  correlated  with 
DNA  content  aneuploidy  and  was  sensitive  and  specific  for 
identifying  patients  with  concurrent  esophageal  adenocarcino¬ 
ma.  As  with  chromosome  3p  in  the  study  of  Tsui  et  al.  (14),  Li 
et  al.  found  that  the  sizes  of  chromosome  abnormalities  were 
small  in  early-stage  BE  compared  with  late  stages,  including 
invasive  adenocarcinoma,  except  in  the  case  of  9p  LOH. 

How  do  the  articles  by  Tsui  et  al.  (14)  and  Li  et  al.  (15)  con¬ 
tribute  to  the  general  concepts  of  the  parallel  progressions  of 
genetic  and  clinical/histopathologic  changes  in  prema¬ 
lignancy  (described  above)?  First,  these  investigators  have 
successfully  applied  high-throughput  DNA  chromosomal  ab¬ 
normality  analysis  technologies,  array  CGH  and  SNP  array 
analysis,  to  small  specimens  of  premalignant  lesion  tissue,  for¬ 
malin-fixed  and  paraffin-embedded  specimens  (CGH),  and 
frozen  samples  (SNP).  We  hope  that  these  two  reports  will 
raise  enthusiasm  for  similar  new  studies  in  these  and  other 
neoplasm  models. 

Second,  these  studies  show  that  chromosomal  abnormalities 
(mostly  deletions)  at  the  3p  chromosome  (14)  and  genome¬ 
wide  (15)  levels  commence  early  in  oral  and  BE  premalignan¬ 
cies  and  progress  with  the  increasing  severity  of  epithelial 
changes  and  that  these  chromosomal  changes  follow  a  se¬ 
quence  defining  early  and  late  molecular  changes.  Third,  these 
two  studies  have  suggested  that  the  size  of  chromosomal  de¬ 
letions  (at  3p  or  genome-wide)  increases  with  the  severity  of 
histopathologic  changes.  This  interesting  finding  has  been  re¬ 
ported  previously  in  the  pathogenesis  of  other  epithelial  tu¬ 
mors,  such  as  lung  tumors,  where  it  has  been  observed  in 
LOH  studies  using  PCR-based  amplification  of  multiple  mi¬ 
crosatellites  in  precisely  microdissected  histologically  normal 
and  premalignant  epithelia  obtained  from  lung  cancer  patients 
and  high-risk  individuals  (smokers;  refs.  24-26).  LOH  at  mul¬ 
tiple  chromosome  3p  (24,  26)  and  8p  (25)  sites  was  shown  to 
commence  at  the  stage  of  normal  epithelium  and  to  increase 
with  progressive  histologic  changes  in  the  lung  squamous  cell 
carcinoma  histology  progression  model  (similar  in  ways  to 
oral  squamous  carcinoma).  This  study  used  24  microsatellite 
markers  spanning  six  continuous  chromosome  3p  regions  in 
showing  that  deletions  in  3p  progressed  from  dysplasia  to  in¬ 
vasive  squamous  carcinoma  (Fig.  1).  In  invasive  tumors  and 
carcinoma  in  situ ,  most  of  the  3p  arm  was  deleted  and  the  ex¬ 
tent  of  the  deletions  was  greater  in  all  cases  than  that  in  cor¬ 
responding  normal  and  premalignant  foci  (24).  These  findings 
were  subsequently  expanded  in  a  LOH  analysis  (using  54  mi¬ 
crosatellite  markers  on  3p)  that  included  samples  from  a  wider 
spectrum  of  premalignancy  and  lung  cancer  specimens,  in¬ 
cluding  samples  from  smokers  and  cancers  with  different 
histologies  (nonsmall  and  small  cell).  This  detailed  allelotyp- 
ing  analysis  identified  multiple  areas  of  discontinuous  LOH 
and  thus  multiple  breakpoints  throughout  the  3p  arm  in  many 
tumor  and  bronchial  epithelial  samples  (26).  Allelic  losses  pre¬ 
sent  in  lung  premalignant  lesions  were  not  random  and  fol¬ 
lowed  a  sequence,  with  the  earliest  and  most  frequent  allelic 
loss  occurring  at  the  3p21.3  region  (24,  26). 

Although  the  two  articles  in  this  issue  of  the  journal  report 
that  discontinuous  LOH  marks  progression  of  premalignancy, 
this  LOH  also  could  be  an  artifact  of  impurity  within  the  pre¬ 
malignant  lesions.  In  samples  with  an  admixture  of  DNA  from 
malignant  and  normal  cells,  some  chromosomal  segments 
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might  seem  to  be  altered  and  others  not  altered,  although  the 
degree  of  alteration  could  be  the  same  across  the  entire  chro¬ 
mosome  arm  because  of  nonuniform  behavior  of  copy  number 
probes,  which  fail  to  reach  a  threshold  level  for  detecting  an 
alteration.  This  failure  has  been  shown  for  SNP  array  analysis 
of  mixtures  of  tumor  cell  line  DNA  with  matched  normal 
DNA;  increasing  admixture  of  normal  DNA  led  to  the  ap¬ 
pearance  of  discontinuous  LOH  because  of  incomplete  detec¬ 
tion  involving  regions  of  uninterrupted  chromosome  3p  LOH 
in  the  pure  tumor  (27).  Further  improvements  in  genomics 
technologies  coupled  with  careful  histologic  analysis  will  be 
required  to  address  this  question. 

The  mechanism  for  chromosomal  deletions  in  individual  pre- 
malignant  samples  is  not  well  established.  LOH  is  considered  to 
occur  through  the  loss  of  a  whole  chromosome  because  of  inap¬ 
propriate  chromosomal  segregation  at  mitosis  and  also  through 


genetic  alterations  that  change  chromosomal  structures  (28). 
Whole,  terminal,  and  interstitial  chromosome  physical  dele¬ 
tions  have  been  shown  to  cause  LOH  on  several  chromosomal 
arms  in  human  tumors  (29).  On  the  other  hand,  mitotic  recom¬ 
bination  and  gene  conversion  seem  to  be  additional  mechan¬ 
isms  causing  LOH  (30,  31).  Unbalanced  translocations,  which 
have  been  confirmed  by  cytogenetic  studies  in  certain  human 
tumors,  have  been  implicated  in  LOH  development  (32).  In 
summary,  six  chromosomal  aberrations  (whole  chromosome, 
terminal  and  interstitial  deletions,  mitotic  recombination,  gene 
conversion,  and  unbalanced  translocation)  are  considered  to 
be  responsible  for  LOH  in  human  carcinogenesis  (Fig.  2). 
However,  the  contribution  of  each  chromosome  alteration  to 
the  occurrence  of  LOH  has  been  examined  only  for  a  few  chro¬ 
mosomal  regions  in  few  tumor  types.  It  was  determined  re¬ 
cently  that  80%  of  LOHs  are  partial  chromosome  deletions 
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Fig.  1.  Progression  of  premalignancy  and  molecular  changes  in  Barrett's  esophageal  adenocarcinoma  [top)  and  oral  and  lung  squamous  cell  carcinomas  (SSC; 
bottom).  Chromosomal  deletions  at  genome-wide  and  the  3p  chromosome  levels  begin  early  in  esophageal  premalignancy  (or  BE)  and,  in  oral  premalignancy, 
progress  and  increase  in  size  with  increasing  histopathologic  severity  (14,  15).  Li  et  al.  (15)  showed  that  the  number  of  larger  copy  losses  (Mb)  was  significantly  higher 
in  advanced  BE  and  esophageal  adenocarcinoma  compared  with  early  BE  lesions  [top).  In  oral  premalignancy  progression,  Tsui  et  al.  (14)  showed  that  the  size 
of  3p  segmental  losses  increased  with  histologic  stage,  as  has  been  shown  previously  in  progression  to  lung  squamous  cell  carcinoma  ( bottom ;  refs.  24-26).  In  the 
lung  squamous  cell  carcinoma  model,  discontinuous  segmental  losses,  or  LOH,  at  3p  are  detected  in  normal  epithelium,  hyperplasia,  and  some  dysplasias, 
whereas  the  whole  arm  is  lost  in  invasive  and  in  situ  carcinomas  and  in  a  subset  of  dysplastic  lesions;  the  vertical  colored  bars  [bottom  right)  indicate  the  size  of 
deletions;  the  gaps,  or  retentions,  between  the  bars  indicate  discontinuous  LOH.  Histology  pictures  of  BE  progression  are  courtesy  of  Elizabeth  Montgomery,  M.D., 
and  of  oral  leukoplakia  progression  are  courtesy  of  Adel  El-Naggar,  M.D.,  Ph.D. 
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Fig.  2.  Chromosomal  alterations  considered  to  be  responsible  for  LOH  in  human  carcinogenesis.  The  whole-chromosome,  terminal,  and  interstitial  deletions  and 
unbalanced  translocation  represent  copy  number  changes.  The  red  arrows  indicate  sites  of  chromosomal  breaks. 


(involving  the  several  chromosomal  alterations  mentioned 
above),  whereas  the  remaining  20%  were  whole-chromosome 
deletions  (33);  these  results  came  from  integrating  information 
on  breakpoints  for  DNA  copy  number  changes  (obtained  by 
a  CGH)  with  numerical  and  structural  chromosomal  altera¬ 
tions  (obtained  by  spectral  karyotyping  in  lung  cancer  cell 
lines).  We  know  that  DNA  copy  number  analyses  of  tumors 
have  limitations  due  to  the  phenomenon  that  most  of  these 
alterations  in  invasive  cancers  are  large,  spanning  many 
genes  (tens  to  hundreds),  including  many  that  likely  are 
not  involved  in  oncogenesis.  Therefore,  studies  of  premalig¬ 
nant  lesions  with  known  outcomes  are  necessary  to  better 
define  chromosomal  loci  and  genes  responsible  for  tumor  de¬ 
velopment  in  humans. 

The  reports  of  Tsui  et  al.  (14)  and  Li  et  al.  (15)  on  the  tumor 
models  oral  squamous  cell  carcinoma  and  Barrett's  esophageal 
adenocarcinoma  give  new  insights  into  and  hope  for  under¬ 
standing  the  highly  complex  nature  of  the  progression  of  pre¬ 


malignancy  to  cancer.  Despite  their  different  pathogenetic 
features,  these  two  tumors  shared  some  common  genetic  pro¬ 
gression  characteristics.  The  application  of  high-throughput 
DNA  technology  in  these  models  showed  the  promise  of  this 
approach  and  the  merit  of  extending  these  methods  to  other 
premalignant  diseases.  It  also  showed  the  potential  of  CGH 
and  SNP  arrays  to  identify  new  candidate  biomarkers  and 
measures  of  clonal  diversity,  both  of  which  can  be  used  in  pa¬ 
tient  management  and  assessment  of  cancer  risk.  Is  less  more? 
High-throughput  approaches  generate  increasing  amounts  of 
data  from  smaller  and  smaller  tissue  samples,  and  chromoso¬ 
mal  deletions  appear  to  increase  progression  of  prema¬ 
lignancy.  These  examples  of  "less  is  more"  represent 
important  advances  in  our  understanding  of  the  molecular 
nature  of  carcinogenesis. 
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Abstract 

The  use  of  tyrosine  kinase  inhibitors  (TKI)  has  yielded  great 
success  in  treatment  of  lung  adenocarcinomas.  However, 
patients  who  develop  resistance  to  TKI  treatment  often 
acquire  a  somatic  resistance  mutation  (T790M)  located  in 
the  catalytic  cleft  of  the  epidermal  growth  factor  receptor 
(EGFR)  enzyme.  Recently,  a  report  describing  EGFR-T790M  as 
a  germ-line  mutation  suggested  that  this  mutation  may  be 
associated  with  inherited  susceptibility  to  lung  cancer. 
Contrary  to  previous  reports,  our  analysis  indicates  that  the 
T790M  mutation  confers  increased  Y992  and  Y1068  phosphor¬ 
ylation  levels.  In  a  human  bronchial  epithelial  cell  line, 
overexpression  of  EGFR-T790M  displayed  a  growth  advantage 
over  wild-type  (WT)  EGFR.  We  also  screened  237  lung  cancer 
family  probands,  in  addition  to  45  bronchoalveolar  tumors, 
and  found  that  none  of  them  contained  the  EGFR-T790M 
mutation.  Our  observations  show  that  EGFR-T790M  provides 
a  proliferative  advantage  with  respect  to  WT  EGFR  and 
suggest  that  the  enhanced  kinase  activity  of  this  mutant  is  the 
basis  for  rare  cases  of  inherited  susceptibility  to  lung  cancer. 
[Cancer  Res  2007;67(10):4665-70] 

Introduction 

Recent  work  has  identified  a  series  of  somatic  mutations  in  exons 
18  to  21  of  epidermal  growth  factor  (EGF)  receptor  (EGFR)  that 
render  lung  tumors  responsive  to  the  gefitinib  and  erlotinib 
therapeutics  (1-3).  However,  in  patients  that  progress  after  drug 
treatment,  it  has  been  observed  that  a  secondary  “resistance” 
mutation  is  often  acquired  in  exon  20  (4-6).  This  mutation,  T790M, 
arises  somatically  in  ~  50%  of  these  cases  (4-6).  In  chronic 
myelogenous  leukemia  (CML)  patients,  an  estimated  50%  to  90% 
of  tumors  with  acquired  resistance  have  the  analogous  resistance 
mutation  (T315I)  in  BCR-ABL  (7).  Interestingly,  Bell  et  al.  (8)  report 
the  first  identification  of  the  T790M  mutation  in  the  germ  line  of 
a  European  family  that  developed  lung  adenocarcinoma  with 
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bronchoalveolar  (BAC)  differentiation.  This  observation  suggests 
that  this  mutation  may  be  associated  with  genetic  susceptibility  to 
lung  cancer  and  may  underlie  familial  predisposition  to  the  disease. 
The  allele,  although  common  in  drug-treated  tumors,  seems 
extremely  rare  in  the  germ  line  of  the  general  population,  as  the 
authors  report  no  mutation  observed  in  782  alleles  sequenced.  Many 
groups  have  reported  that  the  kinase  activity  of  the  EGFR-T790M- 
resistant  mutant  is  indistinguishable  from  wild-type  (WT)  EGFR 
(4-6,  9,  10).  What  is  somewhat  perplexing  is  how  T790M  would 
confer  susceptibility  if  its  activity  were  identical  to  the  WT  molecule. 
Our  analysis  shows  that  T790M  in  fact  does  exhibit  enhanced 
autophosphorylation  at  Y992  and  Y1068,  and  this  mutation  is 
associated  with  a  proliferative  advantage  in  a  human  bronchial 
epithelial  cell  line.  Interestingly,  the  mutation  seems  to  be  rare  as  it 
was  not  found  in  any  of  the  familial  or  sporadic  lung  cancer 
populations  we  screened. 

Materials  and  Methods 

Cell  lines.  HEK293T,  HEK293FT,  and  COS-7  cells  were  maintained  in 
DMEM  supplemented  with  10%  fetal  bovine  serum.  Immortalized  human 
bronchial  epithelial  cells  (HBEC3-KT)  were  maintained  in  keratinocyte 
serum-free  medium  (with  50  pg/mL  bovine  pituitary  extract  and  5  ng/mL 
EGF).  All  cells  were  grown  at  37°  C  in  a  humidified  incubator  with  5%  C02. 

Plasmids,  transfection,  and  viral  infection.  Mammalian  expression 
plasmids  encoding  for  human  EGFR,  EGFR-T790M,  and  EGFR-L858R  were 
kind  gifts  from  William  Pao  (Memorial  Sloan-Kettering  Cancer  Center,  New 
York,  NY).  HEK293T  and  COS-7  cells  were  transfected  with  the  indicated 
EGFR  plasmids  using  LipofectAMINE  2000  (Invitrogen).  Gefitinib  was 
obtained  from  Chemoprevention  Branch,  National  Cancer  Institute 
(Bethesda,  MD)  and  added  to  cells  10  h  before  lysis. 

To  introduce  WT  and  mutant  EGFRs  into  HBEC3  cells,  we  used  the 
pLenti6/V5  Directional  TOPO  Cloning  kit  (Invitrogen).  Construction  of 
pLenti-wt-EGFR  was  described  previously  (11).  The  T790M  and  L747_E749 
deletion  mutations  were  introduced  into  pLenti-wt-EGFR  by  using  site- 
directed  mutagenesis  (Stratagene).  pLenti-KRASV12  vector  was  constructed 
by  cloning  KRASV12  fragment  from  pBabe-KRASV12-hyg  (11)  into  pLenti6/ 
V5  vector.  pLenti6/V5-GW/ lacZ  (Invitrogen)  was  used  as  a  control.  Viral 
transduction  of  HBEC3  cells  was  done  following  the  manufacturer’s 
instructions.  Briefly  the  293FT  cells  were  transiently  transfected  with  viral 
vector  together  with  ViraPower  (Invitrogen).  Forty-eight  hours  after 
transfection,  the  supernatant  of  the  293FT  cells  was  harvested  and  passed 
through  a  0.45-pm  filter  and  frozen  at  —  80  °C.  The  supernatant  was  used  for 
infection  after  addition  of  4  pg/mL  polybrene  (Sigma).  Forty-eight  hours 
after  infection,  drug  selection  of  infected  cells  was  started  with  5  pg/mL 
blasticidin  (Invitrogen)  and  continued  for  7  days. 
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Colony  formation  assays.  Liquid  colony  formation  assays  were  done  as 
described  previously  (11).  Briefly,  200  viable  cells  were  plated  in  triplicate  on 
100-mm  plates  and  cultured  in  keratinocyte  serum-free  medium  supple¬ 
mented  with  50  pg/mL  of  bovine  pituitary  extract  without  EGF.  Surviving 
colonies  were  counted  10  days  later  after  staining  with  methylene  blue,  and 
colonies  >3  mm  in  diameter  were  counted. 

Western  blot  and  antibodies.  Cell  lysates  were  prepared  in  LDS  sample 
buffer  (Invitrogen)  and  electrophoresed  on  NuPAGE  gels  (Invitrogen). 
Protein  was  transferred  to  polyvinylidene  difluoride  membranes  and  blotted 
using  the  antibodies  as  indicated:  anti-EGFR  (Cell  Signaling  Technology), 
anti-EGFR-Y1068  (Cell  Signaling  Technology),  anti-EGFR-Y992  (Cell  Sig¬ 
naling  Technology),  anti-mitogen-activated  protein  kinase  (MAPK;  Cell 
Signaling  Technology),  anti-phosphorylated  MAPK  (Thr202/Tyr204;  Cell 
Signaling  Technology),  anti-phosphorylated  AKT  (Ser473;  Cell  Signaling 
Technology),  anti-AKT  (Cell  Signaling  Technology),  anti-a-tubulin  (Santa 
Cruz  Biotechnology),  and  anti-cyclin  D1  (Santa  Cruz  Biotechnology). 

EGFR  exon  20  genotyping.  Amplification  of  human  EGFR  exon  20 
was  done  via  standard  PCR  methods  using  forward  (5  -GACACTGACG- 
TGCCTCTCC-3')  and  reverse  (5'-TTATCTCCCCTCCCCGTATC-3')  primers. 
PCR  products  were  electrophoresed  on  agarose  gels,  purified,  and  subjected 
to  standard  DNA  sequencing.  The  EGFR-T790M  mutation  is  deduced  by  the 
genotype  (C/T)  at  position  86  in  exon  20. 

Lung  cancer  DNAs.  The  Genetic  Epidemiology  of  Lung  Cancer 
Consortium  (GELCC)  has  accrued  over  700  families  with  three  or  more  first- 
degree  relatives  with  lung  cancer  (12),  of  which  this  study  geno typed  237 
individual  probands.  Paraffin  blocks  of  lung  tumors  (45)  with  BAC 
differentiation  (and  32  corresponding  normals),  in  addition  to  the  fresh 


frozen  lung  tumor  DNAs,  were  obtained  from  the  Washington  University 
Tissue  Procurement  Center  as  paraffin  blocks.  For  the  blocks,  genomic  DNA 
was  prepared  by  slicing  several  paraffin  curls  and  adding  xylenes  to  dissolve 
the  paraffin.  The  remaining  tissue  was  washed  with  ethanol,  dried,  and 
resuspended  in  PCR  buffer  (0.5%  Tween  20,  0.5%  NP40)  and  digested  in 
proteinase  K  (1  mg/mL)  at  55° C  overnight.  Chloroform/isoamyl  alcohol  (24:1) 
was  added  to  the  digest  and  soluble  DNA  was  isolated  from  the  aqueous  phase. 

Results 

To  address  previous  observations  that  EGFR-T790M  activity  is 
the  same  as  the  WT  molecule,  we  did  transient  transfections  of 
EGFR  and  EGFR-T790M  expression  constructs  in  COS-7  and  293T 
cells.  Overexpression  of  EGFR  and  the  T790M  mutant  in  these  cell 
lines  clearly  indicated  that  the  phosphorylation  status  at  Y1068  is 
enhanced  with  respect  to  WT  (Fig.  1A-E).  Tyr1068  is  an  EGFR 
autophosphorylation  site  that  couples  receptor  activation  to  Ras 
signaling  (13).  Interestingly,  our  observations  are  in  disagreement 
with  previously  published  reports  (4-6,  9,  10).  Our  data  also  show 
that  phosphorylation  levels  between  WT  and  T790M  are  indistin¬ 
guishable  at  higher  expression  levels  (Fig.  1C),  yet  differences  are 
more  apparent  at  lower  expression  levels.  As  a  control,  we  also 
showed  the  drug-resistant  nature  of  EGFR-T790M  in  comparison 
with  WT  EGFR  (Fig.  ID).  We  further  addressed  the  activation 
status  of  MAPK,  a  downstream  molecule  phosphorylated  and 
activated  by  EGFR.  We  observed  that  phosphorylated  MAPK 
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Figure  1.  Phosphorylation  of  Y1068  is 
enhanced  in  EGFR-T790M.  FIEK293 
cells  (A)  and  COS-7  cells  ( B )  were 
transfected  with  the  indicated  DNAs  and 
lysed  48  h  after  transfection.  Lysates 
were  immunoblotted  with  anti-EGFR 
and  anti-EGFR-Y1068  as  indicated. 
Phosphorylation  of  Y1068  is  enhanced 
in  the  EGFR-T790M  mutant. 

C,  phosphorylation  and  activation  of 
EGFR-T790M  are  evident  at  lower  levels 
of  protein  expression  compared  with 
higher  levels  in  COS-7  cells  ( lane  3  versus 
lane  5).  D,  gefitinib  does  not  affect  the 
phosphorylation  status  of  T790M-EGFR. 

E,  phosphorylation  of  MAPK  is  enhanced 
in  EGFR-T790M-expressing  (versus 
EGFR-WT  expressing)  cells.  Cells  were 
stimulated  with  EGF  (10  ng/mL)  for  5  min. 


Cancer  Res  2007;  67:  (10).  May  15,  2007 


4666 


www.aacrjournals.org 


EGFR-T790M  Is  a  Rare  Lung  Cancer  Susceptibility  Allele 


a-cyclin  D1  »  —  ^ 


a-tubulin 


Figure  2.  Colony  formation  growth  assay  for  EGFR-T790M.  Viral  transduction  and  selection  of  HBEC  cells  stably  expressing  EGFR  were  done  as  described  previously 
(11).  A,  cells  (200)  were  plated  in  triplicate  and  grown  in  keratinocyte  serum-free  medium  (with  50  |ag/mL  bovine  pituitary  extract  and  without  EGF).  After  10  d, 
cells  were  stained  with  methylene  blue.  B,  stained  cell  colonies  >3  mm  in  diameter  were  counted.  *,  P  <  0.0001,  two-tailed  t  test.  C,  Western  blots  for  EGFR, 
EGFR-Y1068,  EGFR-Y992,  MAPK,  MAPK  (T202/Y204),  AKT,  AKT  (S473),  cyclin  D1,  and  a-tubulin  in  the  HBEC  stable  cell  lines  were  done  as  indicated. 
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(Thr202/Tyr204)  levels  were  enhanced  in  the  T790M  mutant 
compared  with  WT  EGFR  (Fig.  IE). 

We  also  tested  whether  EGFR-T790M  expression  in  an  immor¬ 
talized  human  bronchial  epithelial  cell  (HBEC3)  line  affected 


growth  properties  of  these  cells.  Cells  were  transduced  by  lentivirus 
expressing  EGFR  and  EGFR  mutants  and  selected  for  stable 
integration  as  done  previously  (11).  Two  hundred  cells  were  seeded 
on  plates  without  EGF  supplementation  and  allowed  to  grow  for 
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Figure  3.  EGFR  genotyping  and  linkage  analysis  for  chromosome  7  with  SimWalk2.  A,  the  EGFR-T790M  mutation  was  not  observed  in  familial  lung  cancer 
blood  DNA,  BAC  tumors  and  normals,  or  fresh-frozen  resected  tumors.  B,  the  genetic  position  for  EGFR  is  at  ~73  cM  (based  on  interpolation),  between  markers 
D7S1818  (70  cM)  and  D7S3046  (79  cM).  Simwalk2  analysis  was  based  on  the  parametric  method  using  the  genetic  model  reported  by  Moscatello  et  al.  (18).  Fifty-five 
pedigrees  with  five  or  more  affected  members  were  used.  C  and  D,  two  representative  pedigrees  of  GELCC  collected  families  with  predisposition  to  lung  cancer. 

In  (C)  and  (D),  •  (females)  or  ■  (males)  represent  lung,  throat,  or  laryngeal  cancer.  Numbers  below  each  individuals,  sample  numbers;  numbers  in  brackets,  individual 
ages  at  the  time  the  pedigree  was  constructed;  slashes,  dead  individuals. 
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10  days.  We  observed  that  the  EGFR-T790M-expressing  cells  had  a 
greater  number  of  large  (>3  mm  in  diameter)  colonies  compared 
with  WT  EGFR  but  less  than  the  activated  L747-E749  deletion 
(Fig.  2A  and  B ).  These  results  indicate  that  EGFR-T790M  does  in 
fact  provide  a  growth  advantage  over  WT  EGFR.  When  we  analyzed 
the  status  of  EGFR  phosphorylation  at  Y992  and  Y1068  in  these  cells, 
we  saw  an  increase  in  the  EGFR-T790M  mutant  (Fig.  2C),  consistent 
with  what  was  observed  in  HEK293T  and  COS-7  cells  (Fig.  1A-D). 

To  address  possible  downstream  signaling  mechanisms  that  are 
affected  by  mutant  EGFR,  we  looked  at  the  phosphorylation/ 
activation  status  of  MAPK  and  AKT  (Fig.  2C).  In  contrast  to  our 
observations  in  293T  cells,  we  did  not  see  MAPK  activation  induced 
by  T790M.  However,  this  is  consistent  with  previously  published 
results  where  no  obvious  changes  are  seen  with  the  deletion 
mutant,  yet  colony  formation  is  enhanced  (11).  Transcriptional 
changes,  such  as  up-regulation  of  cyclin  D1  levels,  were  recently 
reported  to  be  a  result  of  EGFR  mutant  signaling  (14);  however,  we 
observed  no  changes  induced  by  the  mutants  (Fig.  2C). 

If  EGFR-T790M  is  a  human  susceptibility  allele,  it  is  necessary  to 
determine  its  prevalence  in  families  with  high  susceptibility  to  lung 
cancer.  With  these  families,  a  set  of  52  extended  pedigrees  was  used 
to  show  linkage  on  chromosome  6q23-25  (12).  A  concerted  effort  by 
the  GELCC  has  attempted  to  identify  the  gene(s)  associated  with 
this  susceptibility.  We  proposed  that  if  the  T790M  mutation  is  a 
common  variant  for  predisposition  to  lung  cancer,  it  could 
potentially  be  enriched  in  probands  from  lung  cancer  families. 
Genomic  DNA  from  237  probands  representing  lung  cancer 
families  with  more  than  three  affected  individuals  was  analyzed. 
PCR  amplification  and  DNA  sequencing  of  the  resistance  mutation 
in  exon  20  were  done.  Interestingly,  we  did  not  observe  the  T790M 
mutation  in  any  of  the  family  probands  analyzed  or  in  any  of  60 
random  fresh-frozen  resected  lung  tumors  (Fig.  3 A).  This  would 
suggest  that  the  T790M  mutation  is  not  enriched  in  our  population 
and  is  likely  a  minor  contributor  to  genetic  susceptibility  in  familial 
lung  cancers.  Furthermore,  in  the  aforementioned  GELCC  linkage 
study,  the  analysis  of  52  families  (and  family  subsets  within)  did  not 
reveal  a  significant  logarithm  of  odds  score  on  or  near  7pll,  where 
the  EGFR  gene  is  located,  which  also  suggests  that  the  mutation  is 
not  a  major  contributor  to  familial  predisposition  to  lung  cancer 
(Fig.  3 B;  ref.  12).  Representative  families  used  in  this  study  are 
indicated  in  Fig.  3C  and  D.  We  further  tested  to  see  if  T790M  might 
in  fact  be  solely  responsible  for  predisposition  to  adenocarcinomas 
with  BAC  differentiation.  We  sequenced  45  BAC  tumors  and  32  of 
the  corresponding  normal  tissues  and  did  not  observe  the  T790M 
mutation  in  any  of  these  samples  (Fig.  3 A).  This  further  suggests 
the  rarity  of  this  potential  predisposing  mutation. 

Discussion 

Several  lines  of  evidence  exist  that  lend  support  to  T790M  as  a 
putative  susceptibility  allele.  T790M  has  been  detected  somatically 
in  tumors  that  have  not  been  treated  with  gefitinib  or  erlotinib, 
which  may  suggest  that  this  mutant  may  have  arisen  during  drug- 
free  cancer  progression  (15).  Furthermore,  the  NCI-H1975  BAC  cell 
line,  which  has  never  undergone  tyrosine  kinase  inhibitor 
treatment,  has  both  activating  L858R  and  resistant  T790M  muta¬ 
tions,  suggestive  that  T790M  may  be  growth  promoting  (5).  Bell  et 
al.  (8)  also  have  observed  that  the  T790M  mutation  seems  to  occur 
in  cis  with  the  activating  mutations.  Perhaps  more  persuasive  is  that 
the  analogous  resistance  mutation  in  CML  patients,  BCR-ABL- 
T315I,  displays  increased  in  vitro  kinase  activity  (16).  Similarly,  the 
analogous  mutations  in  Src  (T341M)  and  FGFR1  (V561M)  also  result 


in  increased  phosphorylation  and  activation  (10).  Why  T790M  in 
EGFR  does  not  reportedly  function  in  a  similar  manner  is  unclear. 

Our  results  suggest  that  the  T790M  mutation  may  in  fact  provide 
a  proliferative  advantage  in  normal  cells  by  increasing  kinase 
activity  and  downstream  signaling.  Our  data  suggest  that  EGFR- 
T790M  does  in  fact  exhibit  higher  kinase  activity  than  WT  molecule 
in  HEK293T  and  COS-7  cell  lines.  Overexpression  in  our  HBEC3  cell 
line  showed  increased  tyrosine  phosphorylation  at  Y992  and  Y1068 
and  increased  colony  formation.  Increased  proliferation  is  most 
evident  in  the  absence  of  EGF  in  the  medium,  suggesting  that  the 
activating  T790M  EGFR  mutation  is  involved  in  the  proliferative 
effects  observed.  EGFR  kinase  activity  is  essential  for  oncogenic 
transformation.  EGFR  extracellular  domain  deletion  mutants  and 
over  expression  of  EGFR  are  commonly  found  in  human  cancers 
(17,  18).  Our  data  also  show  that  phosphorylation  levels  between 
WT  and  T790M  are  indistinguishable  at  higher  expression  levels 
(Fig.  1C),  yet  differences  are  more  apparent  at  lower  expression 
levels.  We  believe  this  may  be  an  explanation  why  others  have  seen 
no  difference  between  EGFR  and  T790M. 

EGFR  signaling  activates  many  pathways  that  lead  to  proliferative 
advantages.  The  Ras/MAPK  pathway  is  activated  via  ligation  of  Grb2 
to  an  activated  EGFR  molecule.  In  our  293T  overexpression  system, 
we  observed  that  MAPK  activity  is  increased  by  T790M  versus  WT 
EGFR.  However,  a  significant  effect  of  T790M  on  these  downstream 
kinases  was  not  evident  in  our  HBEC3  system,  which  might  suggest 
that  other  signaling/ transcriptional  events  are  responsible  for  the 
proliferative  changes.  Recent  work  has  revealed  transcriptional 
changes  in  cyclin  Dl,  maybe  a  key  response  to  gefitinib  resistance  by 
T790M  and  susceptibility  to  the  irreversible  inhibitor  CL-387,785 
(14).  We  did  not  observe  changes  in  cyclin  Dl  levels  in  the  HBEC3 
system  and  believe  other,  yet  unidentified,  mechanisms  exist  to 
account  for  the  proliferative  advantages  caused  by  mutant  EGFR. 

A  recent  study,  using  a  mutant-enriched  PCR  system,  revealed 
the  presence  of  the  T790M  mutation  as  a  minor  clone  in  non-small 
cell  lung  cancer  tumors  (19).  It  is  suggested  that  these  clones  are 
selected  for  during  gefitinib  treatment  and  are  enriched  in  the 
resistant  tumor  but  do  not  provide  the  main  proliferative  function 
for  oncogenesis.  Our  observations  suggest  that  T790M  as  a  germ¬ 
line  mutation  (i.e.,  in  all  cells)  may  provide  the  mild  proliferative 
push  for  lung  cancer  development. 

Many  lines  of  evidence  suggest  the  existence  of  a  limited  number 
of  genetic  factors  that  control  susceptibility  to  lung  cancer  (20-22). 
However,  due  to  a  high-case  fatality  rate  (5-year  survival  rate  of  15%), 
obtaining  biospecimen  samples  for  DNA  analysis  is  particularly 
difficult  (23).  A  collaborative  effort  of  GELCC  has  accrued  DNA  from 
families  with  lung,  throat,  and  laryngeal  cancers  since  the  early 
1990s.  At  present,  771  families  with  three  or  more  first-degree 
relatives  affected  with  lung  cancer  have  been  collected,  of  which  1 1% 
have  sufficient  family-wide  biospecimen  availability  for  any  future 
studies.  Subsequent  sequencing  of  237  families  with  predisposition 
to  lung  cancer,  45  BAC  tumors  and  60  fresh-frozen  resected  tumors, 
did  not  reveal  any  mutations,  suggesting  that  T790M  is  likely  a  rare 
mutation.  Nevertheless,  our  data  provide  a  basis  for  EGFR-T790M  as 
a  rare  susceptibility  allele  in  human  lung  cancer. 
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Impact  of  Smoking  Cessation  on  Global  Gene  Expression  in  the 
Bronchial  Epithelium  of  Chronic  Smokers 
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Abstract  Cigarette  smoke  is  the  major  cause  of  lung  cancer  and  can  interact  in  complex  ways  with 
drugs  for  lung  cancer  prevention  or  therapy.  Molecular  genetic  research  promises  to  eluci¬ 
date  the  biological  mechanisms  underlying  divergent  drug  effects  in  smokers  versus  non- 
smokers  and  to  help  in  developing  new  approaches  for  controlling  lung  cancer.  The 
present  study  compared  global  gene  expression  profiles  (determined  via  Affymetrix  micro¬ 
array  measurements  in  bronchial  epithelial  cells)  between  chronic  smokers,  former  smokers, 
and  never  smokers.  Smoking  effects  on  global  gene  expression  were  determined  from  a 
combined  analysis  of  three  independent  data  sets.  Differential  expression  between  current 
and  never  smokers  occurred  in  591  of  13,902  measured  genes  (P  <  0.01  and  >2-fold 
change;  pooled  data) — a  profound  effect.  In  contrast,  differential  expression  between  cur¬ 
rent  and  former  smokers  occurred  in  only  145  of  the  measured  genes  (P  <  0.01  and  >2-fold 
change;  pooled  data).  Nine  of  these  145  genes  showed  consistent  and  significant  changes 
in  each  of  the  three  data  sets  (P  <  0.01  and  >2-fold  change),  with  eight  being  down-regulated 
in  former  smokers.  Seven  of  the  eight  down-regulated  genes,  including  CYP1B1  and  three 
AKR  genes,  influence  the  metabolism  of  carcinogens  and/or  therapeutic/chemopreventive 
agents.  Our  data  comparing  former  and  current  smokers  allowed  us  to  pinpoint  the  genes 
involved  in  smoking-drug  interactions  in  lung  cancer  prevention  and  therapy.  These  findings 
have  important  implications  for  developing  new  targeted  and  dosing  approaches  for  pre¬ 
vention  and  therapy  in  the  lung  and  other  sites,  highlighting  the  importance  of  monitoring 
smoking  status  in  patients  receiving  oncologic  drug  interventions. 


Chronic  cigarette  smoking  is  the  major  cause  of  lung  cancer 
and  remains  so  for  years  even  after  smoking  cessation  (1,  2). 
Therefore,  the  development  of  agents  for  controlling  lung  can¬ 
cer  generally  targets,  virtually  by  default,  current  and  former 
heavy  smokers.  Smoking  status,  however,  seems  to  influence 
response  to  various  chemopreventive  and  chemotherapeutic 
agents  and  clinical  outcomes  of  their  use  (3,  4).  Three  large 
randomized  clinical  trials  to  prevent  lung  cancer — the 
Alpha-Tocopherol,  Beta-Carotene  Prevention  Study  (5),  Caro¬ 
tene  and  Retinol  Efficacy  Trial  (6),  and  Lung  Intergroup  Trial 
(7) — showed  that  current  heavy  smokers  had  harmful  interac¬ 
tions  (higher  lung  cancer  mortality,  incidence,  and  recurrence) 
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with  preventive  agents  (versus  control  arms);  agent  effects  in 
former  smokers  were  generally  neutral  and  were  not  readily  in¬ 
terpretable  in  never  smokers  because  of  the  exclusion  or  very 
limited  number  of  these  patients  in  these  trials.  Certain  lung 
cancer  therapy  regimens  have  been  shown  to  be  less  effective 
in  current  smokers  than  in  former  and  never  smokers  (8,  9). 
Smoking  can  stimulate  the  metabolic  clearance  of  targeted 
anticancer  therapies,  undoubtedly  diminishing  therapeutic 
benefit  (9,  10).  These  data  highlight  the  importance  of  under¬ 
standing  the  biological  effect  of  chronic  smoking  on  lung  tissue. 

To  understand  why  smokers  and  former  smokers  have  dif¬ 
ferential  responses  to  agents  for  preventing  or  treating  lung 
cancer,  we  analyzed  and  compared  global  gene  expression 
profiles  in  three  independent  cancer-free  cohorts  comprising 
current,  former,  and  never  smokers. 

Materials  and  Methods 

Study  population 

This  study  included  current  smokers,  former  smokers,  and  never 
smokers  with  no  evidence  of  cancer  and  collected  from  separate,  inde¬ 
pendent  studies  conducted  at  The  University  of  Texas  M.  D.  Anderson 
Cancer  Center  (MDACC;  two  studies)  and  the  Boston  Medical  Center 
(BMC;  one  study).  The  three  data  sets  associated  with  the  three  studies 
are  called  MD ACC-1,  MD ACC-2,  and  BMC  throughout  this  article. 
Former  smoking  was  defined  as  having  quit  smoking  for  at  least 
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12  months  before  study  entry.  Participants  included  in  the  MD ACC-1 
and  MD  ACC-2  data  sets  came  from  the  placebo  arm  of  an  ongoing  che- 
moprevention  trial  done  at  M.  D.  Anderson  Cancer  Center.  All  MDACC 
subjects  were  clinically  free  of  cancer  at  enrollment  and  underwent  a 
bronchoscopy  at  baseline.  Bronchial  brushes  were  done  at  six  predeter¬ 
mined  sites  including  the  entry  area  at  each  of  the  five  main  lobes  and 
the  carina,  as  previously  described  (11).  The  study  was  approved  by  the 
MDACC  Institutional  Review  Board,  and  all  MDACC  participants  gave 
signed  informed  consent.  BMC  data  set  participants  had  a  broncho¬ 
scopy  at  the  BMC  and  were  analyzed  in  a  previously  reported  study 
(12)  as  well  as  in  the  current  study.  Potential  subjects  for  the  MDACC 
or  BMC  data  sets  in  the  current  study  were  excluded  if  their  specimen 
images  (produced  as  discussed  below  in  "cRNA  preparation  and  micro¬ 
array  hybridization")  had  defects,  evidence  of  blood  contamination,  or 
other  problems  that  did  not  meet  image  quality  criteria  applied  consis¬ 
tently  across  all  three  data  sets.  All  MDACC  patients  had  smoking  his¬ 
tory,  with  average  pack-years  of  40.6  (±13).  Their  average  age  is  58  (±8) 
years.  Sixty-one  percent  of  them  are  male  and  78%  of  them  are  White. 
More  details  about  the  demographic  data  of  MDACC  data  sets  can  be 
found  in  Supplementary  Table  S2. 

Bronchial  brush  processing  and  RNA  extraction 

For  samples  in  MD  ACC-1  and  MD  ACC-2,  brushes  were  placed  on 
bronchoscopy  in  3  mL  of  plain  DMEM  culture  (Life  Technologies,  Inc.) 
in  sterile  tissue  culture  tubes  and  stored  at  4°C  for  processing  the  same 
day.  The  tubes  were  vortexed  lightly  to  detach  cells  from  the  brushes. 
After  removal  of  the  brush  from  the  tube,  the  cell  suspension  was  cen¬ 
trifuged  at  2,500  rpm  for  5  min.  Cell  pellets  were  then  washed  with 
2  mL  of  PBS  twice,  and  an  aliquot  of  material  was  saved  at  -80°C  until 
RNA  extraction.  For  the  microarray  analysis,  cells  from  the  six  brush¬ 
ing  sites  of  the  same  individual  were  pooled  together  for  RNA  extrac¬ 
tion.  We  used  TRIzol  reagent  (Invitrogen)  for  total  RNA  extraction 
according  to  the  manufacturer's  protocol,  with  a  yield  of  1  to  4  jag  of 
total  RNA  per  sample.  Integrity  of  the  RNA  was  confirmed  by  running 
it  on  an  RNA  6000  Neno  LabChip  (Agilent  Technologies).  The  samples 
in  BMC  data  set  were  processed  similarly  as  described  (12),  except  that 
a  single-round  amplification  protocol  was  used. 

cRNA  preparation  and  microarray  hybridization 

The  first  and  second  cDNA  strands  were  synthesized  as  previously 
described  (13).  The  first  reverse  transcription  was  done  in  the  absence 
of  biotin-labeled  ribonucleotides,  resulting  in  unlabeled  cRNA,  which 
was  then  used  as  starting  material  for  the  second  cycle.  In  the  second 
cycle,  the  first  and  second  cDNA  strands  were  synthesized.  The 
second  transcription  was  done  in  the  presence  of  biotin-labeled- 
ribonucleotides,  resulting  in  labeled  cRNA.  The  cRNA  was  fragmen¬ 
ted  and  checked  by  gel  electrophoresis,  as  reported  earlier  (13). 
The  Affymetrix  GeneChip  system  was  used  for  hybridization,  stain¬ 
ing,  and  imaging  of  the  probe  arrays.  Hybridization  cocktails  of 
300  pL,  each  containing  15  gg  of  cRNA  and  exogenous  hybridization 
controls,  were  prepared  as  previously  described  and  hybridized  to 
U133A  or  U133A  plus  GeneChips  (Affymetrix)  overnight  at  42°C. 
Hybridized  fragments  were  detected  with  streptavidin  linked  to  phy- 
coerythrin  (Molecular  Probes).  GeneChips  were  scanned  and  imaged 
using  Affymetrix  Microarray  Analysis  Suite  version  5.0. 

Microarray  data  normalization 

There  were  two  array  types  used  in  this  study:  U133A  and  U133 
Plus  2.  The  U133A  array  contains  -500,000  distinct  probe  features  in¬ 
terrogating  18,400  human  transcripts  and  variants,  including  13,902 
well-characterized  genes.  The  U133  plus  2.0  array  contains  all  probe 
features  that  are  on  the  U133A  array.  In  addition,  there  are  9,921 
new  probe  sets  representing  6,500  new  genes.  To  facilitate  straightfor¬ 
ward  comparison  of  the  data,  we  used  only  the  probes  that  are  com¬ 
mon  to  both  array  types.  We  also  ignored  data  of  MM  probes.  We  used 
PM  probes  common  to  U133A  and  U133  Plus  2  arrays  to  perform 
quantile  normalization  on  the  probe  level  data  (14).  The  procedure 


was  done  so  that  the  distributions  of  the  probe  signal  intensities  of  a 
sample  are  identical  for  all  samples  within  a  data  set.  Then,  we  used 
PDNN  model  (15)  to  quantify  the  gene  expression  values  from  the 
normalized  probe  signal  intensity  data.  We  then  applied  median¬ 
centering  normalization  on  the  probe  set  level  data  so  that  the  median 
of  expression  values  of  a  sample  was  made  to  be  the  same  for  all  the 
samples  in  all  of  the  data  sets. 

Identification  of  differentially  expressed  genes 

Differential  expression  was  identified  to  be  similar  to  that  described 
by  Wang  et  al.  (16).  We  used  Z  values  (defined  below)  to  assess  differ¬ 
entially  expressed  genes  between  current  and  never  or  former  smo¬ 
kers,  in  whom  the  magnitude  of  the  Z  values  is  assumed  to 
represent  the  effect  of  smoking  cessation.  For  a  given  data  set  contain¬ 
ing  nA  and  nB  samples  in  groups  A  and  B,  respectively,  we  compute 
the  following  test  statistic  Z  for  each  probe  set: 

Z  =  D/o  (A) 

where  D  is  average  difference  between  the  log  expression  values  be¬ 
tween  A  and  B  groups,  cr  is  the  estimated  standard  deviation  (SD)  of 
D: 

u  =  \J  oa2  /nA  +  o2B/nB  (B) 

where  <ja  and  cr|  are  estimated  variances  of  log  expression  values  in 
groups  A  and  B,  respectively.  These  variances  were  estimated  using 
Loess  fit  between  the  mean  log  expression  values  and  the  SD  of  the 
log  expression  values.  The  underlying  assumption  is  that  the  mean 
and  the  SD  are  related  by  a  smooth  function,  which  allows  the  analysis 
method  to  treat  the  SD  as  if  it  were  known. 

Combining  Z  values  from  different  data  sets 

The  Z  values  obtained  from  the  three  data  sets  can  be  combined 
using  the  following  formula: 

Z  =  (Z\/ -\- Z2I +  Z%/ o\) / w2  (C) 

co2  =  1/cj2  +  1/ <j\  +  1/ 0-3  (D) 

where  Zlr  Z2,  and  Z3  were  calculated  using  Eq.  A  from  MD  ACC-1, 
MD  ACC-2,  and  BMC  data  sets,  respectively;  oq,  cr  2/  and  cr  3  were  cal¬ 
culated  using  Eq.  B  from  MD  ACC-1,  MD  ACC-2,  and  BMC  data  sets, 
respectively. 

The  test  statistic  Z  is  supposed  to  form  a  T  distribution  if  the  log  expres¬ 
sion  values  are  normally  distributed.  However,  the  observed  data  slightly 
deviate  from  the  normal  distribution  because  they  contain  more  extreme 
values.  Consequently,  the  significance  of  Z  can  be  overestimated. 

To  alleviate  the  bias  due  to  the  assumption  of  normal  distribution, 
we  used  permuted  data  to  compute  Z*.  The  expression  values  are  ran¬ 
domly  permuted  for  each  probe  set  within  each  data  set.  The  permu¬ 
tation  was  done  10  times  to  construct  an  empirical  cumulative 
distribution  function  of  Z*.  This  distribution  was  assumed  to  be 


Table  1.  Sample  sizes  and  array  types  of  the 
microarray  data  sets 


Data  set 

FS 

CS 

NS 

Array  type 

MDACC-1 

7 

11 

0 

U133A 

MDACC-2 

8 

15 

0 

U133  Plus  2 

BMC 

9 

30 

19 

U133A 

Abbreviations:  FS,  former  smoker;  CS,  current  smoker;  NS, 
never  smoker. 
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Fig.  1.  Comparison  of  Z  values  obtained  from  the  three  data  sets 
(BMC,  MDACC-1,  and  MDACC-2).  Each  point  in  these  scatter  plots  represents 
a  probe  set.  The  probe  sets  with  absolute  Z  values  >5  in  all  three  data  sets 
are  shown  in  red.  Detailed  data  on  these  probe  sets  are  in  Table  2. 


the  distribution  of  Z  values  under  null  hypothesis  (i.e.,  no  differential 
expression),  and  it  was  used  to  estimate  the  P  values  and  the  false  dis¬ 
covery  rate  associated  with  Z  values.  The  permutation  was  done  with¬ 
in  each  of  the  data  sets,  but  never  across  the  data  sets.  Note  that  other 
than  the  permutation  step,  our  method  is  the  same  as  that  described 
by  Wang  et  al.  (16). 

Results 

Our  overall  study  population  numbered  99  individuals, 
composed  of  56  current  smokers,  24  former  smokers,  and  19 


never  smokers  from  three  independent  data  sets  (Table  1).  The 
MDACC-1  and  MDACC-2  data  sets  included  41  chronic  smo¬ 
kers  (26  current,  15  former)  enrolled  in  an  ongoing  chemopre- 
vention  trial  at  M.  D.  Anderson  Cancer  Center.  All  41  of  these 
subjects  had  at  least  a  20-pack-year  smoking  history.  Demo¬ 
graphic  characteristics  of  the  MDACC  cohorts  are  included 
in  Supplementary  Table  SI.  The  BMC  data  set  was  composed 
of  75  current,  former,  and  never  smokers.  Never  smokers  with 
significant  environmental  cigarette  exposure  and  subjects  with 
respiratory  symptoms  or  who  regularly  use  inhaled  medica¬ 
tions  were  excluded.  We  selected  58  members  of  the  BMC 
cohort  for  the  present  analysis  (Table  1)  and  excluded  17  sub¬ 
jects.  Exclusions  from  either  the  BMC  or  MDACC  data  sets 
were  based  on  image  quality  criteria  applied  consistently 
across  all  three  data  sets. 

First,  we  determined  Z  values  (defined  in  Materials  and 
Methods)  in  the  three  data  sets  separately.  Then,  we  compared 
the  Z  values  from  each  data  set  and,  as  shown  in  Fig.  1,  we 
found  that  the  genes  with  the  most  significant  differential  ex¬ 
pressions  (shown  in  red)  are  similar  among  the  three  data  sets. 
The  largest  Z  values  mostly  are  located  in  the  first  and  third 
quadrants  in  the  scatter  plots  of  Fig.  1,  indicating  that  these 


std  normal 


Fig.  2.  Quantile-quantile  plots  of  Z  values.  A,  quantile  of  Z  values  versus 
quantile  of  Z  values  obtained  from  permuted  data.  B,  quantile  of  Z  values  from 
permutation  data  versus  quantile  values  of  standard  normal  distribution. 
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Fig.  3.  Histogram  of  P  values  in  search  of  differential  expression  between 
current  and  former  smokers.  Based  on  BUM  estimate,  345  probe  sets  were 
identified  as  differentially  expressed  with  a  false  discovery  rate  of  32%.  One 
hundred  seventy-six  of  the  345  probe  sets  have  a  fold  change  >2.  Detailed  gene 
information  on  the  176  probe  sets  (145  genes)  is  provided  in  Supplementary 
Table  S2.  The  P  values  were  evaluated  on  the  basis  of  the  combined  Z  values 
from  the  three  data  sets  and  the  combined  Z  values  from  the  permuted  data. 

changes  in  gene  expression  are  consistent  among  the  three 
data  sets. 

To  further  assess  the  statistical  significance  of  the  changes 
in  gene  expression  between  former  and  current  smokers, 
we  used  quantile-quantile  plots  (Fig.  2)  of  Z  values  and 
BUM  plots  (Fig.  3)  to  evaluate  the  distribution  of  P  values. 
Figure  2A  compares  the  quantiles  of  Z  values  calculated  from 
combining  all  three  microarray  data  sets  and  the  quantiles  of 
Zp  (Z  values  calculated  from  permuted  data).  With  permuted 
data,  Z  values  are  bounded  between  -10  and  10.  The  Z  values 
from  observed  data  contain  clear  outliers  >10.  Without  differ¬ 
ential  expression,  the  data  points  in  Fig.  2A  should  be  close  to 
the  diagonal  line  (shown  in  red).  Ideally,  if  the  gene  expression 
data  obey  normal  distributions  and  are  independent  from 
each  other,  we  would  expect  values  of  Zp  to  form  a  standard 


normal  distribution.  However,  Fig.  2B  shows  that  Zp's  have 
wider  ranges  than  that  from  standard  normal  distribution. 
Consequently,  we  used  the  distribution  of  Zp  as  that  from 
the  null  hypothesis  (no  differential  expression  between  former 
and  current  smokers)  to  compute  the  P  values  of  Z  instead  of 
using  the  standard  normal  distribution. 

The  BUM  plot  (17)  presented  a  histogram  of  the  P  values. 
Under  the  null  hypothesis,  the  P  values  should  form  a  uniform 
distribution.  The  sharp  spike  at  the  left  side  of  Fig.  3  repre¬ 
sents  the  effects  of  differential  expression  contradicting  the 
null  hypothesis.  The  uniform  part  of  the  histogram  is  indi¬ 
cated  by  the  red  line  in  Fig.  3.  The  area  above  the  red  line  con¬ 
tains  ~  1,200  probe  sets,  which  is  our  estimated  number  of 
genes  that  are  differentially  expressed  between  the  former 
smokers  and  current  smokers.  Only  a  subset  of  these  genes 
is  identifiable,  however.  According  to  the  BUM  method  (17), 
we  found  345  probe  sets  that  were  differentially  expressed  at 
a  P  value  of  <0.01,  for  which  the  false  discovery  rate  was  es¬ 
timated  to  be  32%.  Of  the  345  probe  sets,  176  have  a  >2-fold 
difference  in  expression  (details  of  these  176  probe  sets  are 
shown  in  Supplementary  Table  S2).  These  176  probe  sets  re¬ 
present  145  nonredundant  significantly  differentially  ex¬ 
pressed  genes  (>2-fold  change;  P  <  0.01).  These  145  genes 
include  9  genes  (Table  2)  with  consistent  and  significant 
changes  in  each  of  the  three  data  sets  (P  <  0.01;  >2-fold 
change).  Eight  of  the  nine  genes  are  down-regulated  after 
smoking  cessation;  one  is  up-regulated.  To  test  the  general  ac¬ 
curacy  of  our  microarray  measurements,  we  compared  them 
with  reverse  transcription-PCR  measurements  of  a  selected 
panel  of  genes,  finding  that  the  reverse  transcription-PCR 
and  microarray  measurements  were  highly  correlated, 
96%  [e.g.,  in  the  case  of  ALDH3A1  (Supplementary  Fig.  SI), 
which  is  the  gene  with  the  largest  change  between  former 
and  current  smokers  (Table  2)].  Furthermore,  although  not  cal¬ 
culated,  the  false  discovery  rate  for  the  subset  of  176  probe  sets 


Table  2.  Genes  with  consistent  fold  changes  >2  in  each  (P  <  0.01)  and  across  (P  <  0.0001)  the  three  data  sets 


Gene 

Fold  changes 

BMC  MDACC-1 MDACC-1  Comb. 

P  (comb.) 

Full  name 

RefSeq 

Probe  set 

ALDH3A1 

6.9 

9.4 

4.0 

6.2 

0.0000 

aldehyde  dehydrogenase  3  family, 
member  A1 

NM_000691 

205623_at 

CYP1B1 

4.2 

5.7 

6.7 

4.9 

0.0000 

cytochrome  P450,  member  1 B1 

NM_000104 

202436_s_at 

MUC5AC 

2.2 

9.6 

3.0 

3.5 

0.0000 

mucin  5AC,  oligomeric 
mucus/gel-forming 

XM_001 130382 

214385_s_at 

AKR1C2 

3.3 

4.2 

3.5 

3.5 

0.0000 

aldo-keto  reductase  family  1 , 
member  C2 

NM_001354 

209699_x_at 

AKR1B10 

3.2 

4.2 

3.8 

3.5 

0.0000 

aldo-keto  reductase  family  1 , 
member  B10 

NM_020299 

206561 _s_at 

AKR1C1 

2.8 

4.0 

3.3 

3.2 

0.0000 

aldo-keto  reductase  family  1, 
member  Cl 

NM_001353 

204151_x_at 

NQOI 

2.8 

2.3 

2.4 

2.6 

0.0001 

NAD(P)H  dehydrogenase,  quinone  1 

NM_000903 

210519_s_at 

AKR1C3 

2.5 

2.1 

3.1 

2.5 

0.0000 

aldo-keto  reductase  family  1, 
member  C3 

NM_003739 

209160_at 

SCGB1A1 

-2.0 

-2.4 

-2.6 

-2.4 

0.0001 

secretoglobin,  family  1A, 
member  1  (uteroglobin) 

NM_003357 

205725_at 
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Fig.  4.  Histograms  of  P  values  in  search  of  differential  expression  between 
never  smokers  and  current  smokers  (4)  and  between  former  smokers  and  never 
smokers  (8).  Only  data  from  BMC  data  set  were  used  the  plots.  False  discovery 
rates  were  estimated  to  be  5%  and  16%  for  P  <  0.01  in  A  and  B,  respectively. 

should  be  lower  than  that  (32%)  estimated  for  the  345  probe 
sets,  and  the  false  discovery  rate  for  the  nine  changed  genes 
that  were  validated  across  three  data  sets  should  be  lower  still 
because  each  subset  adds  new  criteria  that  increase  reliability 

For  comparison,  we  also  examined  differential  gene  expres¬ 
sion  between  current  and  never  smokers  (Fig.  4A).  Similar  to 
that  in  Fig.  3,  the  peak  volume  above  the  red  line  represents 
the  number  of  differentially  expressed  genes,  which  is  ~  11,000 
probe  sets.  This  number  is  >9  times  greater  than  the  number 
detected  in  the  comparison  between  former  and  current  smo¬ 
kers  (Fig.  3).  We  found  591  nonredundant  genes  with  statisti¬ 
cally  significant  changes  (>2-fold  change  and  P  <  0.01)  in 
pooled  data  of  the  three  data  sets,  a  group  that  is  >4  times 
larger  than  the  group  of  such  differentially  expressed  genes 
detected  in  the  comparison  between  current  and  former  smo¬ 
kers.  Of  the  145  significantly  changed  genes  between  current 
and  former  smokers,  77  are  consistent  with,  and  68  are  not 
consistent  with,  the  591  such  genes  between  current  and  never 
smokers  (Supplementary  Table  S2).  The  nine  genes  with  con¬ 
sistent  and  significant  changes  between  former  and  current 
smokers  in  each  of  the  three  data  sets  are  in  the  subset  of  77 
common,  significantly  changed  genes.  Figure  4B  compares 
former  smokers  with  never  smokers. 

The  scope  of  differential  expressions  in  Fig.  4A  is  much  lar¬ 
ger  than  that  in  Fig.  3,  which  may  be  due  to  differences  in 
sample  size.  A  principal  component  analysis  (Fig.  5),  however, 
supported  the  conclusion  that  the  larger  differential  expres¬ 
sion  in  Fig.  4A  compared  with  that  in  Fig.  3  is  not  simply 
due  to  sample  size.  The  gene  expression  profile  of  each  patient 
is  represented  by  its  two  main  principal  components.  Two  dis¬ 


tinct  clusters  emerge  in  Fig.  5,  and  the  cluster  to  the  left 
(Compl  < — 10)  contains  mostly  never  smokers.  The  right-side 
cluster  is  predominated  by  a  mixture  of  current  and  former 
smokers,  which  supports  the  conclusion  that  former  smokers 
are  more  similar  to  current  smokers  than  to  never  smokers. 

Discussion 

In  probing  13,902  genes,  we  found  that  591  were  differen¬ 
tially  expressed  in  current  versus  never  smokers  and  that  only 
145  of  these  591  (25%)  were  also  differentially  expressed  in 
current  versus  former  smokers.  Among  these  145  genes,  9 
were  significantly  differentially  expressed  (8  overexpressed, 
1  underexpressed;  Table  2)  by  >2-fold  in  current  versus  former 
smokers  in  each  (P  <  0.01)  and  in  the  pooled  data  (P  <  0.0001) 
of  the  three  data  sets  (two  MDACC,  one  BMC)  included  in  this 
study.  Therefore,  our  present  study  pinpoints  and  validates 
nine  differentially  expressed  genes  in  former  versus  current 
smokers. 

Seven  of  the  eight  validated  genes  overexpressed  in  cu¬ 
rrent  smokers — CYP1B1,  four  AKRs,  ALDH3A1,  and  NQOl 
(Table  2) — are  involved  in  drug  and/ or  carcinogen  metabolism 
(9,  10,  18-27).  Polycyclic  aromatic  hydrocarbons  in  tobacco 
smoke  are  known  to  bind  to  and  activate  the  aryl  hydrocar¬ 
bon  receptor  and  thus  induce  CYP1B1  (10).  CYP1B1  expres¬ 
sion  is  of  special  interest  because  it  may  contribute  both  to 
increased  drug  metabolism  and  to  carcinogenesis  of  the  aero- 
digestive  tract  (1,  18-20).  The  metabolic  clearance  of  docetax- 
el,  tamoxifen,  gefitinib,  erlotinib,  and  other  cancer  prevention 
and  therapy  drugs  is  enhanced  by  YP1B1  (9,  21-23).  Up- 
regulation  of  CYP1B1  and  the  six  other  validated  overex¬ 
pressed  metabolizing  genes  by  smoking  is  likely  involved 
in  the  adverse  interactions  between  smoking  and  drugs  for 
lung  cancer  prevention  and  therapy;  smoking  cessation 
down-regulates  these  gene  expressions  and  thus  may  reduce 
or  eliminate  the  adverse  drug  interactions. 

Four  of  the  eight  most-up-regulated  genes  we  detected 
in  current  smokers  (Table  2)  are  members  of  the  AKR 


Fig.  5.  Principal  component  analysis.  The  two  main  principal  components  were 
used  to  visualize  the  relationships  among  patients  with  different  smoking 
status.  Each  point  represents  a  patient.  Current  smokers  were  shown  in  black, 
former  smokers  in  red,  and  never  smokers  in  blue.  BMC  data  were  shown  in 
circles,  MDACC-1  data  in  pluses,  and  MDACC-2  data  in  triangles. 
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family  (AKR1C1,  AKR1C2,  AKR1C3,  and  AKR1B10;  ref.  28). 
AKR1B10  is  overexpressed  in  non-small-cell  lung  cancer  and 
squamous  metaplasia  in  association  with  smoking  (24,  26). 
AKR1C1,  AKR1C2,  and  AKR1C3  are  known  to  be  involved 
in  tobacco  carcinogen  and/or  drug  metabolism.  AKR1C1 
overexpression  is  correlated  with  a  poor  prognosis  of  non¬ 
small-cell  lung  cancer  and  is  associated  with  chemotherapeu¬ 
tic  drug  resistance  (25,  29).  Data  suggest  that  a  potential  role  of 
AKR1B10  in  retinoic  acid  signaling  (30)  may  be  a  factor  in 
the  negative  effects  of  retinoic  acid  (retinoids)  and  its  relative 
|3 -carotene  in  smokers  in  chemoprevention  trials  (3-7).  Several 
studies  have  shown  that  overexpression  of  AKR1C1,  AKR1C2, 
or  AKR1C3  contributes  to  the  resistance  of  various  tumor 
types,  including  lung  cancer,  to  cisplatin-based  chemotherapy 
(25,  31-33). 

Various  biases  can  produce  inconsistencies  between  similar 
data  sets.  These  biases  can  stem  from  differences  in  age,  race, 
sex,  smoking  history,  and  sample  processing.  Regarding  sam¬ 
ple  processing,  for  example,  MD ACC-1  and  MD ACC-2  sam¬ 
ples  involved  two  rounds  of  RNA  amplification  versus  a 
single  round  in  the  BMC  set.  Two  rounds  of  amplification  are 
known  to  cause  loss  of  signals  for  probes  that  target  far  away 
from  the  3'  end  of  mRNA  sequences.  The  consistent  changes  in 
smoking  cessation-related  genes  in  all  three  independent  data 
sets  support  the  robustness  of  our  present  findings. 

Gene  expression  profiling  in  bronchoscopy  specimens  offers 
a  direct  assessment  of  the  effects  of  cigarette  smoking  in  the 
lungs.  Gene  expression  patterns  vary  greatly  between  indivi¬ 
duals,  however,  because  of  genetic  variations  and  different  en¬ 
vironmental  influences.  A  report  by  Spira  et  al.  (on  a  relatively 
broad  array  of  differentially  expressed  metabolizing  and  anti¬ 
oxidant  genes  in  current  versus  former  smokers;  ref.  12)  pro¬ 
vided  us  with  the  opportunity  to  increase  the  robustness  of 
our  gene  expression  analyses  by  adding  the  BMC  data  set  to 
our  MDACC  data  sets.  As  we  prepared  our  present  results  for 
publication,  the  Spira  group  published  another  report  (34)  that 
extended  their  earlier  study,  as  do  the  complementary  and 


confirmatory  findings  we  report  here.  We  were  able  to  identify 
the  specific  drug-metabolizing  genes  involved  in  smoking- 
drug  interactions,  including  the  overexpressed  genes  in  cur¬ 
rent  versus  former  smokers,  because  of  the  cross  validation 
and  increased  statistical  power  provided  by  adding  the 
BMC  data  set  (12)  to  our  MD  ACC-1  and  MD  ACC-2  data  sets. 
The  combined  effect  of  these  reports  is  to  increase  the  robust¬ 
ness  of  their  interrelated  findings  and  thus  their  appeal  for  hy¬ 
pothesis  generation. 

Our  results  also  show  that  the  scope  of  genetic  changes  fol¬ 
lowing  smoking  cessation  is  much  smaller  than  that  associated 
with  chronic  smoking  (Figs.  2  and  4),  possibly  explaining  the 
persistent  high  lung  cancer  risk  in  former  smokers  (35).  Sur¬ 
prisingly  at  the  time  (~10  years  ago),  we  and  others  previously 
found  in  assessments  limited  to  specific  genetic  alterations  that 
smoking-related  genetic  changes  persisted  after  smoking  ces¬ 
sation  in  a  population  similar  to  those  of  MDACC-1, 
MD  ACC-2,  and  BMC  (36,  37).  Showing  similar  genetic  altera¬ 
tions  in  current  and  former  smokers,  results  of  the  more  sophis¬ 
ticated  global  genomic  profiling  approach  of  our  present  and 
other  studies  are  consistent  with  the  earlier  findings  (12,  34). 

Our  findings  underscore  the  importance  of  smoking  status 
in  clinical  trials,  showing  that  smoking  effects  on  metabolizing 
genes  potentially  can  interfere  with  drugs  in  standard  or  in¬ 
vestigational  chemoprevention  or  therapy  not  only  in  the  lung 
but  in  other  sites  as  well.  Future  research  directions  should 
include  ( a )  increased  monitoring  of  smoking  status  and  in¬ 
creased  smoking  cessation  efforts  in  any  trial  setting  because 
of  adverse  smoking  effects  on  drug  uptake  and  metabolism, 
and  ( b )  the  development  of  new  dosing  and  targeted  ap¬ 
proaches  to  counteract  adverse  smoking-drug  interactions  in 
the  lung.  New  targeted  approaches  should  consider  the  sig¬ 
naling  pathways  of  drug-metabolizing  genes  that  were  vali¬ 
dated  in  this  study. 
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Differential  roles  of  DR4,  DR5  and  c-FLIP  in  regulation  of  geranylgeranyltransferase  I 
inhibitor-induced  augmentation  of  tumor  necrosis  factor-related  apoptosis-inducing 
ligand-induced  apoptosis.  Shuzhen  Chen,1  Lei  Fu.1  Shruti  M.  Raja,1  Ping  Yue,1  Yuri  K. 
Peterson,2  Fadlo  R.  Khuri,1  and  Shi-Yong  Sun.1  department  of  Hematology  and  Medical 
Oncology,  Winship  Cancer  Institute,  Emory  University  School  of  Medicine,  Atlanta,  Georgia, 
and  2 Department  of  Pharmacology  and  Cancer  Biology,  Duke  University  Medical  Center, 
Durham,  North  Carolina 

Geranylgeranyltransferase  I  (GGTase  I)  has  emerged  as  a  cancer  therapeutic  target. 
Accordingly,  small  molecules  that  inhibit  GGTase  I  have  been  developed  and  exhibit 
encouraging  anticancer  activity  both  in  vitro  and  in  vivo  in  preclinical  studies.  However,  the 
underlying  anticancer  mechanisms  of  GGTase  I  inhibitors  remain  unclear.  Here  we  have 
demonstrated  a  novel  mechanism  by  which  GGTase  I  inhibition  modulates  apoptosis. 
Inhibition  of  GGTase  I  by  GGTI-298  induced  apoptosis  and  augmented  tumor  necrosis  factor- 
related  apoptosis-inducing  ligand  (TRAIL)-induced  apoptosis  in  human  lung  cancer  cells. 
GGTI-298  induced  the  expression  of  both  DR4  and  DR5,  two  cell  surface  death  receptors  for 
TRAIL,  and  downregulated  the  expression  of  c-FLIP,  a  key  inhibitor  of  death  receptor-induced 
apoptosis.  Consistently,  another  highly  selective  GGTase  I  inhibitor,  GGTI-DU40,  but  not  its 
inactive  analog  SN-DU40,  exerted  similar  effects.  Enforced  expression  of  c-FLIP  or 
knockdown  of  DR5  expression  protected  cells  from  induction  of  apoptosis  by  the  combination 
of  GGTI-298  and  TRAIL,  indicating  that  induction  of  DR5  and  downregulation  of  c-FLIP 
mediate  augmentation  of  TRAIL-induced  apoptosis  by  GGTase  I  inhibition.  Surprisingly, 
blockade  of  DR4  induction  by  knocking  down  DR4  expression  sensitized  cancer  cells  to 
GGTI298/TRAIL-induced  apoptosis,  suggesting  that  DR4  induction  may  play  an  opposite  role 
to  DR5  induction  in  regulating  GGTI298/TRAIL-induced  apoptosis.  The  combination  of 
GGTI-298  and  TRAIL  was  more  effective  than  each  single  agent  in  decreasing  the  levels  of 
IicBa  and  p-Akt,  implying  that  GGTI298/TRAIL  activates  NF-kB  and  inhibits  Akt. 
Interestingly,  knockdown  of  DR5,  but  not  DR4,  prevented  GGTI298/TRAIL-induced  IicBa 
and  p-Akt  reduction,  suggesting  that  DR5  mediates  reduction  of  IicBa  and  p-Akt  induced  by 
GGTI298/TRAIL.  In  contrast,  DR4  knockdown  further  facilitated  IicBa  and  p-Akt  reduction 
by  GGTI298/TRAIL,  suggesting  that  DR4  also  plays  an  opposite  role  to  DR5  in  regulation  of 
GGTI/TRAIL-induced  apoptotic  signaling.  To  our  knowledge,  this  is  the  first  demonstration 
that  DR4  and  DR5  may  play  differential  roles  in  regulation  of  death  receptor-mediated 
apoptosis.  Collectively,  we  conclude  that  inhibition  of  GGTase  I  with  GGTI  inhibitors  induces 
DR5  and  downregulates  c-FLIP,  leading  to  augmentation  of  TRAIL-induced  apoptosis.  Thus, 
inhibition  of  GGTase  I  can  be  a  novel  strategy  for  enhancing  TRAIL-based  cancer  therapy. 
(Supported  by  the  Georgia  Cancer  Coalition  Distinguished  Cancer  Scholar  award,  Department 
of  Defense  grant  W81XWH-04- 1-0 142- VITAL,  and  NIH/NCI  SPORE  P50  grant  CA128613- 
01;  SY  Sun,  and  FR  Khuri  are  Georgia  Cancer  Coalition  Distinguished  Cancer  Scholars) 


The  eIF4E/eIF4G  interaction  inhibitor  4EGI-1  augments  TRAIL-induced  apoptosis 
through  DR5  induction  and  c-FLIP  downregulation  independent  of  inhibition  of  cap- 
dependent  protein  translation.  Songqing  Fan.  Yikun  Li.  Ping  Yue,  Fadlo  R.  Khuri,  and  Shi- 
Yong  Sun.  Departments  of  Hematology  and  Medical  Oncology,  Winship  Cancer  Institute, 
Emory  University  School  of  Medicine,  Atlanta,  Georgia 

Cap-dependent  protein  translation  plays  an  important  role  in  regulation  of  oncogenesis 
primarily  through  regulation  of  the  expression  of  certain  oncogenic  proteins  (e.g.,  cyclin  D1 
and  HIFla,).  Thus  inhibition  of  cap-dependent  protein  translation  has  emerged  as  an  attractive 
therapeutic  strategy.  The  small  molecule  4EGI-1  was  identified  as  an  inhibitor  of  cap- 
dependent  translation  initiation  by  disrupting  eIF4E/eIF4G  association  through  binding  to 
eIF4E  and  exhibits  growth-inhibitory  and  apoptosis-inducing  activity  in  cancer  cells.  We  were 
interested  in  its  therapeutic  effects  in  human  lung  cancer  cells.  4EGI-1  as  a  single  agent 
inhibited  the  growth  and  induced  apoptosis  of  human  lung  cancer  cells.  When  combined  with 
the  death  ligand  tumor  necrosis  factor-related  apoptosis-inducing  ligand  (TRAIL),  enhanced 
apoptosis-induced  activity  was  observed.  In  lung  cancer  cells,  4EGI-1  inhibited  eIF4E/eIF4G 
interaction,  reduced  the  levels  of  cyclin  D1  and  HIFla,  both  of  which  are  regulated  by  a  cap- 
dependent  translation  mechanism.  Moreover,  4EGI-1  upregulated  DR5  expression  and 
downregulated  c-FLIP  levels.  Small  interfering  RNA  (siRNA)-mediated  blockade  of  DR5 
induction  or  enforced  expression  of  c-FLIP  abrogated  4EGI-l’s  ability  to  enhance  TRAIL- 
induced  apoptosis,  indicating  that  both  DR5  induction  and  c-FLIP  downregulation  contribute 
to  enhancement  of  TRAIL-induced  apoptosis  by  4EGI-1.  However,  inhibition  of  eIF4E/eIF4G 
interaction  by  eIF4E  siRNA-mediated  knockdown  of  eIF4E  effectively  reduced  the  levels  of 
cyclin  D1  and  HIFla,  but  failed  to  induce  DR5  expression,  downregulate  c-FLIP  levels,  and 
augment  TRAIL-induced  apoptosis.  As  well,  the  mTOR  inhibitor  rapamycin,  which  inhibits 
cap-dependent  translation  initiation,  did  not  enhance  TRAIL-induced  apoptosis.  Collectively, 
we  conclude  that  4EGI- 1  augments  TRAIL-induced  apoptosis  through  induction  of  DR5  and 
downregulation  of  c-FLIP  independent  of  inhibition  of  cap-dependent  protein  translation.  (S. 
Fan.  and  Y.  Li  share  first  authorship;  this  work  was  supported  by  the  Georgia  Cancer  Coalition 
Distinguished  Cancer  Scholar  award,  DOD  grant  W81XWH-04- 1-0 142- VITAL,  NIH  ROl 
CA1 18450-01  and  NIH  SPORE  P50  grant  CA128613-01;  SY  Sun,  and  FR  Khuri  are  Georgia 
Cancer  Coalition  Distinguished  Cancer  Scholars) 
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Running  title:  IGF  expression  and  lung  carcinogenesis 


Abstract 

Insulin-like  growth  factor  1  receptor  (IGF-1R)  signaling  has  been  implicated  in  several 
human  neoplasms.  However,  the  role  of  serum  levels  of  insulin-like  growth  factors  (IGFs) 
in  lung  cancer  risk  is  controversial.  Our  study  assessed  the  role  of  tissue-derived  IGFs  in 
lung  carcinogenesis.  We  found  that  IGF-1  and  IGF -2  levels  in  bronchial  tissue  specimens 
containing  high-grade  dysplasia  were  significantly  higher  than  in  those  containing  normal 
epithelium,  hyperplasia,  and  squamous  metaplasia.  Derivatives  of  human  bronchial 
epithelial  cell  lines  with  activation  mutation  in  KRAS  (VI 2)  or  loss  of  p53,  genetic 
changes  frequently  observed  during  lung  carcinogenesis,  overexpressed  IGF-1  and  IGF-2. 
Tobacco  carcinogen  (TC)  4-(methylnitrosamino)-l-(3-pyridyl)-l-butanone  (NNK) 
enhanced  transformed  characteristics  of  these  cells,  which  were  significantly  suppressed 
by  the  inhibiting  the  action  of  IGF-1,  IGF-2,  or  the  insulin-like  growth  factor  receptor 
(IGFR).  We  further  determined  the  role  of  IGF  expression  in  lung  tumorigenesis  using  a 
mouse  model  with  a  lung-specific  IGF-1  transgene  after  exposure  to  TCs,  including 
urethane  or  NNK  plus  benzo[a]pyrene  (BaP).  Finally,  we  demonstrated  antitumor 
activities  of  the  selective  IGF-1R  tyrosine  kinase  inhibitor  c/.v-3-[3-(4-methyl-piperazin-l- 
yl)-cyclobutyl]-l-(2-phenyl-quinolin-7-yl)  -imidazo[l,5-a]pyrazin-8-ylamine  (PQIP)  in 
IGF-1  transgenic  mice  carrying  NNK/BaP-induced  lung  tumors.  Our  results  demonstrate 
that  airway  epithelial  cells  produce  IGFs  in  an  autocrine  manner,  and  these  IGFs  act 
jointly  with  TCs  to  stimulate  lung  carcinogenesis.  Thus,  the  use  of  selective  IGF-1R 
inhibitors  may  be  a  rational  approach  to  controlling  lung  cancer. 
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Department  of  Thoracic/Head  and  Neck  Medical  Oncology,  the  University  of  Texas  M.  D. 
Anderson  Cancer  Center,  1515  Holcombe  Boulevard,  Houston,  TX  77030-4009. 

Lung  cancer  has  been  reported  the  highest  cancer  mortality  in  both  sexes  worldwide. 
Early  detection  and/or  development  of  clinically  efficient  novel  preventive/therapeutic  targets  for 
lung  cancer  are  urgently  required  to  reduce  the  high  mortality  rate  associated  with  this  disease. 
During  the  development  of  squamous  cell  carcinoma  (SCC)  in  the  lung,  bronchial  epithelial  cells 
exhibit  a  progressive  series  of  morphologically  distinct  changes:  hyperplasia,  squamous 
metaplasia,  dysplasia,  carcinoma  in  situ,  and  finally  invasive  SCC.  Here,  we  investigated 
molecular  mechanisms  involved  in  hyperplasia  of  bronchial  epithelial  cells.  We  demonstrated 
that  ErbBl  ligands,  including  epidermal  growth  factor  (EGF),  TGF-alpha,  and  amphiregulin, 
completely  disrupted  apical-basal  polarity  and  induces  hyperplasia  of  normal  human 
tracheobronchial  epithelial  (NHTBE)  cells.  EGF-induced  hyperplasia  was  completely  blocked 
by  EGFR  inhibitor,  erlotinib,  and  MEK1/2  inhibitor,  U0126,  suggesting  involvement  of  MEK- 
ERK  signaling.  Further  studies  showed  that  EGF  substantially  upregulated  cyclin  D1  and  these 
inhibitors  completely  blocked  the  upregulation.  Promoter  analysis  of  cyclin  D1  revealed  that  Ap- 
1  transcription  factor  regulates  the  overexpression  of  cyclin  Dl.  Depletion  of  AP-1  component  c- 
Jun  using  siRNA  completely  abrogated  EGF-induced  cyclin  Dl  expression  and  also  inhibited 
EGF-induced  hyperplasia  in  NHTBE  cells.  In  conclusion,  we  showed  that  EGF  induced 
hyperplasia  of  primary  bronchial  epithelial  cells  and  AP-1  plays  a  crucial  role. 
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Despite  recent  advances,  the  prognosis  of  non-small-cell  lung  cancer  (NSCLC)  is  still 
dismal.  The  recurrence  rate  of  early-stage  NSCLC  is  -40%  within  five  years  after  a 
potentially  curative  treatment.  Because  of  the  limited  prognostic  power  of  the  current 
pathologic  and  clinical  criteria,  providing  accurate  molecular  prediction  of  the  clinical 
outcome  is  currently  needed.  Recently,  using  cDNA  microarray  and  RT-PCR  methods 
it  has  been  reported  that  the  expression  of  genes  HER3,  LCK  (lymphocyte-specific 
protein  tyrosine  kinase),  DUSP6  (dual-specificity  phosphatase  6),  and  STAT1  (signal 
transducer  and  activator  transduction  1)  closely  associate  with  recurrence-free  and 
overall  survival  among  NSCLC  patients  (Chen  et  al,  New  Eng  J  Med  2007:  356:11-20). 
As  only  HER3  protein  expression  has  been  previously  reported  in  lung  cancer,  we 
investigated  the  immunohistochemical  (IHC)  expression  of  those  four  genes’  proteins 
in  a  large  set  of  NSCLC  tissue  microarray  specimens  (N=306;  194  adenocarcinomas 
and  112  squamous  cell  carcinomas),  and  correlate  their  expression  with 
clinico-pathologic  features,  including  prognosis.  Protein  expression  was  examined 
semi-quantitatively  using  both  intensity  and  extension  of  staining,  and  a  final  score 
was  calculated  for  each  marker.  HER3  and  DUSP6  proteins  were  expressed  only  in 
the  cytoplasm  of  tumor  cells  (38%  and  88%  of  the  tumors,  respectively),  STAT-1 
expressed  in  both  tumor  and  stromal  cell  compartments  (55%  of  tumors  in  each 
compartment),  and  LCK  expressed  only  in  inflammatory  stromal  cells  (85%  of  tumors). 
None  of  the  markers  expression  correlated  with  patients’  recurrence-free  and  overall 
survival.  STAT1  expression  was  lost  in  45%  of  NSCLCs  and  was  significantly  lower  in 
patients  with  squamous  cell  carcinoma  compared  to  adenocarcinoma  (P=0.010), 
non-smoking  history  compared  to  smokers  (P=0.001),  and  in  more  advanced  TNM 
stages  (P=0.015).  Our  findings  point  out  the  difficulties  of  validation  gene  expression 
data  using  protein  expression  analysis  and  that  tissue-based  in  situ  methodologies 
are  important  to  identify  the  type  of  cells  expressing  specific  molecular  markers.  We 
have  confirmed  at  protein  level  that  STAT 1  is  frequently  lost  in  NSCLC  tumor  tissues 
and  the  pattern  of  immunostaining  in  tumor  cells  is  compatible  with  tumor  suppressor 
gene  activity  and  may  represent  a  novel  tumor  suppressor  gene  for  this  neoplasm 
(Supported  by  Grant  DoD-W81XWH-04-1-0142  and  W81XWH-05-2-0027). 
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ABSTRACT 


Purpose:  To  determine  the  frequency  of  estrogen  receptors  (ER)  a  and  p  and  progesterone 
receptor  (PR)  protein  immunohistochemical  expression  in  a  large  set  of  NSCLC  specimens,  and 
to  compare  our  results  with  those  for  some  of  the  same  antibodies  that  have  provided 
inconsistent  results  in  previously  published  reports. 

Experimental  Design:  Using  multiple  antibodies,  we  investigated  the  immunohistochemical 
expression  of  ERa  and  p  and  PR  in  317  NSCLCs  placed  in  tissue  microarrays  and  correlated 
their  expression  with  patients’  clinicopatho logic  characteristics,  and  in  adenocarcinomas,  with 
EGFR  mutation  status. 

Results:  ERa  and  p  were  detected  in  the  nucleus  and  cytoplasm  of  NSCLC  cells;  however,  the 
frequency  of  expression  (nucleus:  a,  5%-36%,  and  P,  42%-56%;  cytoplasm:  a,  <  1  %— 42%,  and 
P,  20%-98%)  varied  among  the  different  antibodies  tested.  PR  was  expressed  in  the  nuclei  of 
malignant  cells  in  63%  of  the  tumors.  ERa  nuclear  expression  significantly  correlated  with 
adenocarcinoma  histology,  female  gender,  and  history  of  never  smoking  (P  =  0.0048  to 
<0.0001).  In  NSCLC,  higher  cytoplasmic  ERa  expression  significantly  correlated  with  worse 
RFS  (HR  1.77,  95%  Cl,  1.12,  2.82;  P  =  0.015)  in  multivariate  analysis.  In  adenocarcinomas, 
ERa  expression  correlated  with  EGFR  mutation  (P  =  0.0029  to  <0.0001).  ERp  and  PR,  but  not 
ERa,  expressed  in  the  normal  epithelium  adjacent  to  lung  adenocarcinomas. 

Conclusions:  ERa  and  P  and  PR  are  frequently  expressed  in  NSCLC.  ERa  expression 
distinguishes  a  subset  of  NSCLC  that  has  defined  clinicopathologic  and  genetic  features.  The 
correlation  between  ER  and  EGFR  mutation  in  lung  adenocarcinoma  suggests  that  it  might  be 
important  to  target  both  pathways  simultaneously  in  lung  cancer  therapy. 
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INTRODUCTION 


Lung  cancer  is  the  most  common  cause  of  cancer  mortality  worldwide,  with  over  1 
million  deaths  each  year  (1).  Lung  cancer  includes  several  histological  types,  the  most  frequently 
occurring  of  which  are  two  types  of  non-small  cell  lung  carcinoma  (NSCLC):  adenocarcinoma 
and  squamous  cell  carcinoma  (2).  During  the  last  two  decades,  mortality  rates  associated  with 
cancer  have  continued  to  decrease  across  all  major  sites  in  both  men  and  women;  however,  the 
rates  for  lung  cancer  in  females  have  continued  to  increase  (3,  4).  Despite  global  statistics 
estimating  that  15%  of  lung  cancer  in  men  and  53%  in  women  are  not  attributable  to  smoking 
(1),  smoking  remains  the  primary  risk  factor  for  lung  cancer.  The  higher  proportion  of  lung 
cancer  in  females  who  have  never  smoked  compared  with  males  who  have  never  smoked 
suggests  a  possible  role  for  gender-dependent  hormones  in  the  development  of  lung  cancer  (5). 

Estrogen  receptors  (ER)  a  and  (1  are  expressed  in  normal  lung  tissue  and  in  lung  tumors 
in  both  men  and  women  (6),  yet  the  data  are  inconsistent  as  to  whether  ER  expression  is  gender 
biased  (6-9)  or  associated  with  NSCLC  overall  survival  (9-1 1).  The  data  reported  on  the 
immunohistochemical  expression  for  both  ER  receptors  in  NSCLC  remain  controversial.  ERa 
has  been  reported  to  be  expressed  in  the  nucleus  (0%-45%)  and  cytoplasm  (0%-73%)  of 
malignant  lung  cancer  cells  in  the  cases  examined  (9,  10,  12,  13).  The  percentages  for  ERfl  are 
more  consistent,  with  46%-60%  of  NSCLC  cases  showing  only  nuclear  expression  (9-14). 
Similarly,  two  reports  suggested  that  progesterone  receptor  (PR)  is  frequently  (47%)  expressed 
in  NSCLC  tumor  cells,  and  this  expression  correlated  with  better  patient  outcome  (12,  15). 

Several  in  vitro  and  in  vivo  studies  have  provided  evidence  supporting  a  biological  role 
for  estrogens  in  lung  carcinogenesis  by  direct  promotion  of  cell  proliferation — estrogens 
stimulate  the  proliferation  of  NSCLC  cells  through  estrogen  receptor-mediated  signaling, 
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whereas  anti-estrogens  inhibit  the  growth  of  NSCLC  cells  (6,1, 13,  16,  17).  Estrogen  can 
directly  stimulate  the  transcription  of  estrogen-responsive  genes  in  the  nucleus  of  lung  cells  and 
can  also  transactivate  growth  factor- signaling  pathways — the  epidermal  growth  factor  receptor 
(EGFR)  pathway,  in  particular  (13,  18).  In  estrogen  stimulation  of  lung  cancer  cells,  EGFR 
ligands  are  rapidly  released,  activating  the  EGFR  and  mitogen-activated  protein  kinase  1 
(MAPK1)  growth  pathways  (19).  Activation  of  the  EGFR  pathway  appears  to  play  an  important 
role  in  the  pathogenesis  and  progression  of  NSCLC  (20).  In  lung  cancer  cells,  the  constitutive 
activation  of  EGFR  is  achieved  by  several  mechanisms,  including  increased  production  of 
ligands,  increased  levels  of  the  receptor,  and  mutation  of  the  EGFR  tyrosine  kinase  domain  (20- 
22).  Of  interest,  EGFR  protein  expression  is  downregulated  in  response  to  estrogens  and 
upregulated  in  response  to  anti-estrogens,  suggesting  that  a  reciprocal  control  mechanism  exists 
between  the  EGFR  and  ER  pathways  (19). 

The  purpose  of  the  current  study  was  to  determine  the  frequency  of  ERa  and  p  and  PR 
protein  immunohistochemical  expression  in  a  large  set  of  NSCLCsplaced  in  tissue  microarray 
(TMA)  specimens  and  to  compare  our  results  with  those  for  some  of  the  same  antibodies  that 
have  provided  inconsistent  results  in  previously  published  reports  (9-14).  In  addition,  the 
receptor-expression  results  were  correlated  with  patients’  clinicopathologic  features,  including 
NSCLC  histology,  gender,  smoking  history,  and  patient  outcome,  and  in  adenocarcinoma  with 
tumors’  EGFR  activating  mutation  status.  Finally,  to  understand  ERa  and  P  and  PR  protein 
expression  role  in  the  early  pathogenesis  of  lung  cancer,  we  investigated  the  characteristics  of 
ERa  and  p  and  PR  protein  expression  in  the  non-malignant  respiratory  epithelium  adjacent  to 
tumors  taken  from  a  subset  of  our  retrospectively  reviewed  lung  adenocarcinoma  cases. 
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MATERIALS  AND  METHODS 


Case  selection  and  TMA  construction.  We  obtained  archived,  formalin- fixed,  paraffin- 
embedded  (FFPE)  tissue  from  surgically  resected  (with  curative  intent)  lung  cancer  specimens 
(lobectomies  and  pneumonectomies)  containing  tumor  and  adjacent  normal  epithelium  tissues 
from  the  Lung  Cancer  Specialized  Program  of  Research  Excellence  Tissue  Bank  at  The 
University  of  Texas  M.  D.  Anderson  Cancer  Center  (Houston,  TX),  which  has  been  approved  by 
the  institutional  review  board.  The  tissue  had  been  collected  from  1997  to  2001,  and  the  tissue 
specimens  were  histologically  examined  and  classified  using  the  2004  World  Health 
Organization  classification  system  (2).  We  selected  317  NSCLC  tissue  samples  (201 
adenocarcinomas  and  116  squamous  cell  carcinomas)  for  our  TMAs.  TMAs  were  constructed 
using  triplicate  1-mm  diameter  cores  per  tumor,  and  each  core  included  central,  intermediate,  and 
peripheral  tumor  tissue.  Detailed  clinical  and  pathologic  information,  including  demographics, 
smoking  history  (never-  and  ever-smokers),  and  smoking  status  (never,  former,  and  current), 
clinical  and  pathologic  tumor-node-metastasis  (TNM)  stage,  overall  survival  (OS)  duration,  and 
time  to  recurrence  was  available  for  most  cases  (Supplementary  Table  1).  Patients  who  had 
smoked  at  least  100  cigarettes  in  their  lifetime  were  defined  as  smokers,  and  smokers  who  quit 
smoking  at  least  12  months  before  their  lung  cancer  diagnosis  were  defined  as  former  smokers. 
Tumors  were  pathologic  TNM  stages  I-IV  according  to  the  revised  International  System  for 
Staging  Lung  Cancer  (23). 

To  assess  the  immunohistochemical  expression  of  ERa  and  p  and  PR  markers  in  the  non- 
malignant  respiratory  epithelium  adjacent  to  lung  tumors,  we  selected  whole  histology  sections 
containing  tumor  and  adjacent  lung  tissue  from  64  adenocarcinomas  that  were  included  in  our 
TMAs. 
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Immunohistochemical  staining  and  evaluation.  The  following  antibodies  against  ERa 
and  p  and  PR  were  purchased:  1)  ERa-1,  clone  6F1 1,  Novocastra,  Leica  Microsystems  Inc. 
(Bannockburn,  IL);  2)  ERa-2,  clone  6F11,  Chemicon,  Millipore  Corporate  (Billerica,  MA);  3) 
ERa-3,  clone  HC20,  Santa  Cruz  Biotechnology  Inc.  (Santa  Cruz,  CA);  4)  ERa-4,  clone  1D5, 
Lab  Vision  Corporation  (Fremont,  CA);  5)  ERp-1,  clone  HI 50,  Santa  Cruz  Biotechnology 
(Santa  Cruz,  CA);  2)  ERp-2,  clone  14C8,  GeneTex  Inc  (San  Antonio,  TX);  and  6)  PR,  clone 
SP2,  Lab  Vision  Corporation  (Fremont,  CA).  Details  on  immunohistochemistry  conditions  and 
characteristics  of  the  antibodies  are  listed  in  Supplementary  Table  2.  Immunohistochemical 
staining  was  performed  as  follows:  5-pM  FFPE  tissue  sections  were  deparaffinized,  hydrated, 
heated  in  a  steamer  for  10  minutes  with  10  mM  sodium  citrate  (pH  6.0)  for  antigen  retrieval,  and 
washed  in  Tris  buffer.  Peroxide  blocking  was  done  with  3%  H2O2  in  methanol  at  room 
temperature  for  15  min,  followed  by  10%  fetal  bovine  serum  in  tris-buffered  saline-t  for  30  min. 
The  slides  were  incubated  with  primary  antibody  at  an  ambient  temperature  for  60  min  for  all 
antibodies;  the  exception  was  ERp  14C8  (ERp-2),  which  was  incubated  overnight  at  4°C, 
washed  with  phosphate-buffered  saline,  and  incubated  with  biotin-labeled  secondary  antibody 
(Envision  Dual  Link  +,  DAKO,  Carpinteria,  CA)  for  30  min.  Staining  for  the  slides  was 
developed  with  0.05%  3',  3-diaminobenzidine  tetrahydrochloride,  which  had  been  freshly 
prepared  in  0.05  mol/L  Tris  buffer  at  pH  7.6  containing  0.024%  H2O2,  and  then  the  slides  were 
counterstained  with  hematoxylin,  dehydrated,  and  mounted.  FFPE  normal  breast  tissue  was  used 
as  the  positive  control.  For  the  negative  control,  we  used  the  same  specimens  used  for  the 
positive  controls  but  replaced  the  primary  antibody  with  phosphate-buffered  saline. 

Two  observers  (M.G.R.  and  I.I.W.)  jointly  quantified  the  immunohistochemical 
expression  of  ERs  and  PR  using  light  microscopy  (magnification  20x).  Both  nuclear  and 
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cytoplasmic  expressions  were  quantified  using  a  four-value  intensity  score  (0,  1+,  2+,  and  3+) 
and  the  percentage  (0%  to  100%)  of  reactivity.  We  defined  the  intensity  categories  as  follows:  0 
=  no  appreciable  staining;  1+  =  barely  detectable  staining  in  epithelial  cells  compared  with  the 
stromal  cells;  2+  =  readily  appreciable  staining;  and,  3+  =  dark  brown  staining  of  cells.  Next,  an 
expression  score  was  obtained  by  multiplying  the  intensity  and  reactivity  extension  values 
(range,  0-300). 

EGFR  mutation  analysis.  Exons  18-21  of  EGFR  were  polymerase  chain  reaction  (PCR)- 
amplified  using  intron-based  primers  as  previously  described  (24,  25).  Approximately  200 
microdissected  FFPE  cells  were  used  for  each  PCR  amplification.  All  PCR  products  were 
directly  sequenced  using  the  Applied  Biosystems  PRISM  dye  terminator  cycle  sequencing 
method.  All  sequence  variants  were  confirmed  by  independent  PCR  amplifications  from  at  least 
two  independent  microdissections  and  DNA  extraction,  and  the  variants  were  sequenced  in  both 
directions,  as  previously  reported  (24,  25). 

Statistical  analysis.  The  immunohistochemical  expression  and  clinicopathologic  data 
data  were  summarized  using  standard  descriptive  statistics  and  frequency  tabulations.  BFiP  plots 
were  generated  to  summarize  the  distribution  of  ER  and  PR  expressions.  Associations  between 
the  marker  expression  and  patients’  clinical  and  demographical  variables  (including  age,  sex, 
smoking  history,  histology  type,  and  pathologic  stage)  were  assessed  using  appropriate  methods 
including  the  chi-square  or  Fisher  exact  test  for  categorical  variables,  and  Wilcoxon  rank  sum  or 
Kruskal-Wallis  test  for  continuous  variables.  The  Spearman  rank  correlation  coefficient  was  used 
to  estimate  the  correlation  between  immunohistochemistry  markers.  Kaplan-Meier  survival 
curves  for  patient  OS  and  RFS  were  also  generated.  The  log-rank  test  was  used  to  identify  the 
difference  between  the  patient  groups  for  both  overall  and  RFS.  For  univariate  and  multivariate 
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analyses  for  immunohistochemical  expressions,  the  Cox  proportional  hazard  model  was  used. 
Two-sided  P  values  less  than  0.05  were  considered  statistically  significant. 
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RESULTS 


Correlation  of  expression  of  ER  antibodies.  We  examined  four  commercially  available 
antibodies  against  ERa:  two  using  the  same  clone  (6F1 1)  and  two  antibodies  against 
ERp  (Supplementary  Table  2).  Using  the  scores  of  expression  generated  from  all  NSCLCs  we 
analyzed  the  correlation  of  the  expression  in  the  malignant  cells  for  the  four  ERa  and  the  two 
ERp  antibodies  tested.  All  four  of  the  ERa  antibodies  showed  nuclear  staining,  and  two  of  the 
four  antibodies  also  detected  expression  in  the  cytoplasm  of  malignant  cells  (ERa-3,  clone 
HC20,  and  ERa-4,  clone  1D5).  The  two  ERa  clone  6F1 1  antibodies  (ERa-1  and  ERa-2), 
obtained  from  two  different  companies,  demonstrated  only  nuclear  staining.  All  four  of  the 
ERa  antibodies  significantly  correlated  with  each  other  at  nuclear  expression  (Spearman  rank 
correlation:  r  =  0.32  to  0.48;  P  <  0.0001;  Supplementary  Table  3).  Also,  significant  correlation 
was  detected  in  the  staining  of  the  two  ERa  antibodies,  showing  cytoplasmic  expression  (r  = 

0.43  ;  P  <  0.0001).  There  was  no  statistically  significant  correlation  between  both  of  the  ERp 
antibodies  examined  in  their  nuclear  expression;  although  they  significantly  correlated  at  their 
cytoplasmic  expression,  the  correlation  coefficient  was  very  low  (r  =  0.17;  P  =  0.005). 

Frequency  of  ER  and  PR  expression  in  NSCLC  specimens  by  histology.  We  analyzed 
the  frequency  of  any  ER  and  PR  immunohistochemical  expression  (positive  cases,  score  >0)  for 
each  antibody  tested  by  NSCLC  tumor  histology,  and  the  data  are  summarized  in  Table  1. 
Representative  microphotographs  of  the  expression  of  ER  and  PR  with  some  of  the  antibodies 
tested  are  shown  in  Fig.  1 .  ERs  and  PR  were  detected  in  the  nucleus  of  malignant  cells  by  all  of 
the  corresponding  antibodies  tested.  However,  when  expressed,  the  percentage  of  malignant  cells 
showing  staining  was  low  in  general,  with  an  average  percentage  of  positive  expression  of:  19% 
(range  2-90%)  for  ERa-1  nuclear;  13%  (range  2-93%)  for  ERa-2  nuclear;  21%  (range  1-60%) 
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and  19%  (range  3-73%)  for  ERa-3  nuclear  and  cytoplasmic,  respectively;  and,  11%  (range  3- 
97%)  and  7%  (range  3-30%)  for  ERa-4  nuclear  and  cytoplasmic,  respectively.  The  average 
percentages  of  positive  cells  expressing  ERp  were  37  %  (range  3-90%)  and  37%  (range  3-97%) 
for  ERp-1  nuclear  and  cytoplasmic,  respectively;  and,  13  %  (range  1-77)  and  24  %  (range  3-67) 
for  ERp-2  nuclear  and  cytoplasmic,  respectively. 

Although  there  are  important  variations  in  the  frequency  of  expression  between  the 
nuclear  ERa  antibodies  tested,  adenocarcinoma  histology  showed  significantly  higher  frequency 
of  expression  than  squamous  cell  carcinomas  for  all  ERa  antibodies  (P  <0.0001-0048;  Table  1). 
For  nuclear  expression  of  ERp,  the  data  obtained  with  both  antibodies  tested  were  relatively 
consistent,  and  the  adenocarcinoma  histology  demonstrated  a  significantly  higher  frequency  of 
expression  than  the  squamous  cell  carcinoma  did  with  the  ERp-2  antibody  (P  =  0.0069).  Two  of 
the  ERa  (ERa-3  and  ERa-4)  and  both  ERp  antibodies  also  detected  ER  expression  in  the 
cytoplasm  of  NSCLC  cells  (Table  1).  While  the  ERp-2  antibody  was  expressed  in  the  cytoplasm 
of  a  subset  of  NSCLCs,  the  ERp-1  antibody  was  expressed  in  nearly  all  of  the  tumors. 
Cytoplasmic  expression — only  for  the  ERa-3  antibody — was  significantly  higher  in 
adenocarcinomas  when  compared  with  squamous  cell  carcinomas  ( P  =  0.0064). 

In  the  NSCLC  tissues,  PR  expression  was  frequently  detected  in  the  nuclei  of  malignant 
cells  only.  Squamous  cell  carcinoma  histology  showed  a  marginally  significant  higher  frequency 
of  expression  than  that  of  the  adenocarcinomas  ( P  =  0.05;  Table  1). 

Correlation  between  ER  and  PR  expression  in  NSCLC  and  patients’  clinicopathologic 
features.  We  correlated  expression  of  ERs  and  PR  for  each  antibody  tested  with  the  patients’ 
clinicopathologic  characteristics,  including  histology,  gender,  tobacco  history,  and  TNM 
pathological  stage  using  the  expression  score  as  a  continuous  variable.  Using  this  type  of 
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analysis,  adenocarcinoma  histology  also  showed  a  statistically  significant  higher  nuclear 
expression  for  all  ERa  antibodies  and  for  the  ERp-2  antibody  than  squamous  histology  (Table 
2).  Of  great  interest  was  the  fact  that  the  NSCLC  tissues  obtained  from  females  and  never 
smokers  demonstrated  statistically  significant  higher  expression  of  nuclear  ERa  and  p  for 
several  of  the  antibodies  used  (Table  2).  No  correlations  between  the  expression  of  PR  and  the 
clinicopathologic  characteristics  were  found. 

We  performed  OS  and  recurrence-free  survival  (RFS)  analyses  to  determine  the 
expression  of  ERs  and  PR  for  each  antibody  tested  by  using  specimens  from  317  patients  with 
NSCLC  with  a  median  follow-up  of  6.1  years  for  OS  and  4.2  years  for  RFS.  No  association  was 
detected  between  the  expression  of  ER  and  PR  and  OS.  Of  interest,  any  expression  of 
cytoplasmic  ERa,  using  ERa-4  antibody,  and  nuclear  ERp,  using  the  ERp-1  antibody,  conferred 
to  patients  a  significantly  worse  RFS  in  the  both  univariate  and  multivariate  analysis  (Fig.  2  and 
Table  3).  However,  only  the  cytoplasmic  expression  of  ERa-4  correlated  with  worse  RFS  when 
dichotomized  score  being  used  (HR  1.77;  95%  Cl,  1.1 1-2.81;  P  =  0.0156;  Table  3). 

Correlation  between  ER  and  PR  expression  in  NSCLC  and  tumor  EGFR  mutation 
status.  Among  182  adenocarcinoma  cases,  EGFR  mutations  of  the  tyrosine  kinase  domain 
(exons  18-21)  were  detected  in  31  (17%)  cases.  Most  (88%)  EGFR  mutations  were  detected  in 
the  exons  19  and  21,  and  we  did  not  find  correlation  between  the  location  of  the  mutation  and  ER 
a  and  p  expression.  We  correlated  the  ER  and  PR  scores  and  any  expression  (positive  cases, 
score  >0)  with  EGFR  mutation  status.  Interestingly,  EGFR  mutant  adenocarcinomas 
demonstrated  statistically  significant  higher  expression  than  wild-type  tumors  of  nuclear  ERa, 
cytoplasmic  ERa,  and  nuclear  ERp  when  tested  with  antibodies  ERa-3,  ERa-4,  and  ERP-1, 
respectively  (Table  4  and  Fig.  3).  Because  there  was  a  higher  incidence  of  EGFR  mutation  in 
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lung  adenocarcinoma  cases  from  patients  with  a  history  of  never  smoking,  Asian  ethnicity,  or 
female  characteristics  (data  not  shown),  we  adjusted  the  effects  of  age,  gender,  smoking  history, 
ethnicity,  and  pathological  stage  in  the  correlation  of  ERa  and  (3  with  EGFR  mutation  status. 
After  linear  regression  analysis,  all  the  significant  correlations  remained  statistical  significant. 
There  was  no  correlation  between  PR  expression  and  EGFR  mutation  status. 

ER  and  PR  immunohistochemical  expression  in  the  lung  respiratory  airway  adjacent 
to  adenocarcinoma  cases.  To  characterize  the  pattern  of  expression  of  ER  and  PR  in  the 
respiratory  airway  field  in  patients  with  lung  cancer,  we  selected  64  adenocarcinoma  cases  (35 
females  and  29  males;  19  never  smoked,  13  current  smokers,  and  32  former  smokers),  and  we 
studied  the  immunohistochemical  expression  of  ERa  and  p  and  PR  in  the  respiratory  cells  lining 
the  small  bronchi  (n  =  35  cases),  bronchioles  (n  =  83  cases),  and  alveoli  exhibiting  Type  II  cells 
hyperplastic  changes  (n  =  15  cases)  using  the  same  semiquantitative  scoring  system  used  in  the 
TMAs.  For  ERs,  we  tested  the  ERa-4  and  ERP-1  antibodies.  From  each  case,  we  used 
immunohistochemistry  to  examine  whole  tissue  sections  from  a  mean  of  three  different  paraffin 
blocks  (range  3-6)  containing  tumor  and  adjacent  normal  lung  tissue.  We  found  that  ERa  was 
not  expressed  in  the  airway  epithelium  adjacent  to  lung  adenocarcinomas,  including  epithelial 
samples  from  21  positive  tumors  (Supplementary  Table  4).  In  contrast,  ERp  was  widely 
expressed  in  the  cytoplasm  of  respiratory  cells:  91%  of  bronchi,  84%  of  bronchioles,  and  29%  of 
the  hyperplastic  alveoli.  ERp  nuclear  immunostaining  was  found  less  frequently:  5%  of  bronchi, 
10%  of  bronchioles,  and  none  of  the  hyperplastic  alveolar  cells.  Noticeably,  in  the  bronchial 
cells,  we  identified  two  patterns  of  cytoplasmic  immunostaining:  a  homogeneous  staining  in  all 
types  of  bronchial  cells  and  heterogeneous  staining  comprising  only  ciliated  cells  with  mainly 
supranuclear  or  apical  expression  (Fig.  1).  PR  was  found  in  the  nucleus  of  56%  of  bronchi,  61% 
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of  bronchioles,  and  33%  of  hyperplastic  alveoli.  Of  interest,  there  was  a  high  level  of  correlation 
(28/33  comparisons,  85%)  between  the  expression  of  PR  in  the  normal  epithelium  and  the 
corresponding  tumors.  Twenty  (95%)  out  of  21  cases  with  PR  positive  in  the  normal  epithelium 
were  detected  in  patients  with  tumors  that  also  expressed  this  receptor. 
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DISCUSION 


ERa  and  p  frequently  expressed  in  our  NSCLC  cases,  and  ERa  expression  distinguished 
a  subset  of  NSCLC  that  has  defined  clinicopathologic  and  genetic  features.  Although  the 
immunohistochemical  expression  of  ERa  and  p  has  been  reported  in  tumor  tissue  specimens 
from  surgically  resected  NSCLCs,  the  data  on  the  fraction  of  tumors  expressing  ER  are  still 
controversial.  Previous  studies  on  ERa  immunohistochemical  expression  in  formalin-fixed  and 
paraffin-embedded  NSCLC  specimens  using  seven  different  antibodies  identified  nuclear 
expression  in  malignant  cells  in  frequencies  that  ranged  from  none  (10,  14)  to  18%  (26)  and  38% 
(12).  Similarly,  in  other  studies,  the  frequency  of  ERa  cytoplasmic  expression  in  NSCLC  ranged 
from  0%-3%  (12,  26)  to  35%  (1 1)  and  73%  (27).  In  the  current  study,  using  four  different 
commercially  available  ERa  antibodies,  we  also  identified  a  wide  range  of  percentages  in  the 
frequency  of  NSCLCs  exhibiting  any  expression  of  ERa  in  the  nucleus  (7%-54%)  and  in  the 
cytoplasm  (0%-42%)  of  tumor  cells.  However,  in  our  study,  when  the  scores  of 
immunohistochemical  expression  were  analyzed  as  continuous  variables,  all  of  the 
ERa  antibodies  significantly  correlated  with  each  other  at  nuclear  and  cytoplasmic  locations. 

A  similar  situation  is  observed  when  the  ERp  immunohistochemical  expression  data  are 
examined  in  NSCLC.  Several  previous  studies,  using  six  different  antibodies,  have  reported 
frequencies  of  ERp  expression  in  tumors  with  a  wide  range  of  percentages  at  the  nuclear 
location — 0%  (9),  34%47%  (10,  12,  14),  and  61%-84%  (9,  11)  — but  not  in  the  cytoplasm  of 
malignant  cells,  where  most  of  the  studies  have  shown  no  reactivity  (9,  10,  12,  14);  some 
expression  was  seen  in  a  small  number  of  cases  (6)  or  low  frequency  of  expression  in  a  large 
number  of  cases  (10%)  (1 1).  In  the  present  study  using  two  antibodies,  any  ERp  nuclear 
expression  was  detected  in  about  half  (56%  and  42%)  of  the  NSCLCs,  and  cytoplasmic 
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expression  was  found  in  a  wider  range  (20%-98%)  of  our  cases.  We  do  not  have  a  definitive 
explanation  to  the  high  levels  of  expression  of  ERp  in  NSCLC  cells  in  our  study,  and  the 
discordance  with  previous  reports.  However,  immunohistochemical  analysis  has  shown  the 
distribution  of  ERp  to  be  much  more  widespread  than  ERa  (28-30).  Several  studies  have 
reported  that  ERp  immunohistochemical  expression  is  frequently  detected  in  the  nucleus  and 
cytoplasm  of  normal  respiratory  cells  (28).  While  expression  has  been  questioned  by  suggestions 
that  this  observation  is  based  on  non-specific  binding  produced  by  unpurified  antibodies  (31), 
multiple  reports  have  shown  the  presence  of  a  non-nuclear  pool  of  ERs  in  normal  and  malignant 
cells  (32-35).  Yang  et  al  (35)  used  one  of  the  same  ERp  antibodies  that  we  used  (ERp-1)  and 
demonstrated  mitochondrial  localization  of  this  receptor  in  several  normal  human  and  murine 
cells,  suggesting  a  role  for  ERp  receptor  in  the  cytoplasm  of  cells.  Our  finding  of  high  frequency 
of  ERp  expression,  using  ERb-1  antibody,  in  the  cytoplasm  of  normal  respiratory  cells  from  our 
lung  adenocarcinoma  patients  are  consistent  with  these  findings. 

Several  discrepancies  were  observed  when  we  compared  our  results  with  those  published 
previously  (6,  9,  10,  12)  using  the  same  antibodies,  especially  for  ERa.  For  example,  our  ERa-3 
antibody,  raised  against  the  COOH-terminus  region  of  the  protein,  detected  any  nuclear  and 
cytoplasmic  expressions  in  54%  and  42%  of  our  NSCLC  cases,  respectively.  Using  this 
antibody,  nuclear  expression  was  reported  in  a  small  number  of  NSCLC  tumors  by  Stabile  et  al 
(6)  and  in  none  of  the  130  tumors  examined  by  Kawai  et  al  (10).  At  the  cytoplasmic  location  of 
malignant  cells,  both  studies  reported  positive  immunostaining  (6,  10),  with  up  to  73%  of  cases 
in  the  study  performed  by  Kawai  et  al  (10). 

Why  these  inconsistent  results  on  the  immunohistochemical  expression  of  ER  a  and  p 
occur  raises  a  very  important  question.  Clearly  the  reasons  for  the  inconsistent  results  include  the 
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use  of  different  antibodies  manufactured  from  different  clones  and  by  different  companies. 
Indeed,  some  of  these  antibodies  have  been  made  against  different  parts  of  the  protein:  full 
length,  NH2-terminues,  and  COOH-terminus  regions.  It  has  been  suggested  that  several  mRNA 
splicing  variants  of  ERa  have  been  detected  in  lung  cancer  cell  lines,  and  antibodies  raised 
against  epitopes  in  the  deleted  exons  of  ER  may  give  conflicting  results  (6).  In  addition,  it  is 
important  to  note  that  there  are  multiple  criteria  reported  to  assess  ERa  and  p  positivity  in 
NSCLC  tissues.  Although  most  studies  considered  different  levels  of  intensity  (usually  a  scale  0- 
3+)  of  expression  at  nuclear  and  cytoplasmic  locations  combined  with  the  percentage  of 
malignant  cells  expressing  a  given  intensity,  the  cutoff  levels  of  expression  vary  significantly 
between  studies  (e.g.,  1+  in  >10%  of  cells;  1+  in  1%— 25%  of  cells;  >50%  of  cells;  score  “0-8”, 
etc)  (6,  9-12,  14,  26). 

Because  there  were  different  levels  of  ERa  and  P  immunohistochemical  expression 
detected  using  different  antibodies  in  ours  and  the  previous  studies  (6,  9-12,  14,  26),  we 
correlated  the  expression  of  ER  using  all  of  the  antibodies  we  tested  with  the  patients’ 
clinicopathological  features  and  the  tumors’  EGFR  mutation  status.  The  evaluation  of  multiple 
antibodies  for  ER  expression  adds  strength  to  our  findings.  In  our  study,  we  analyzed  the 
immunohistochemical  scores  as  continuous  and  dichotomized  variables,  and  a  significantly 
higher  expression  of  nuclear  ERa  was  detected  with  all  four  antibodies  tested  in  adenocarcinoma 
than  squamous  cell  carcinoma  histology — three  out  of  the  four  antibodies  tested  in  tumors 
obtained  from  females  compared  with  males  and  from  people  who  had  never  smoked  compared 
with  smokers.  The  two  previous  studies  reporting  ERa  nuclear  expression  in  NSCLC,  which 
examined  a  relatively  large  series  of  cases,  did  not  address  differences  of  expression  based  on 
histology  types  or  patients’  clinicopathologic  features  (12,  26).  In  the  NSCLC  tissues  that  we 
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reviewed,  higher  expression  of  ERp  correlated  significantly  with  tumor  adenocarcinoma 
histology  and  the  patients’  female  gender  for  ERp-1  antibody,  and  correlated  with  the  patients’ 
history  of  never  smoking  with  the  ERp-2  antibody. 

Few  studies  have  shown  inconsistent  results  on  whether  ER  expression  is  biased  to  any 
gender  using  different  types  of  specimens  and  assays  (6-9).  Schwartz  et  al  (9),  using  a  different 
antibody  than  ours,  reported  that  NSCLCs  obtained  from  females  were  46%  less  likely  to  have 
ERp-positive  tumors  than  males  in  a  multivariate  analysis.  In  addition,  mRNA  expression  of 
ERa  has  been  reported  to  be  significantly  higher  in  lung  tumors  from  women  than  from  men  (8). 
In  a  small  number  of  NSCLC  tumor  tissue  specimens,  ERa  and  p  gene  transcripts  have  been 
found  to  be  expressed  in  similar  levels  when  comparing  samples  obtained  from  females  and 
males  (7).  Adenocarcinoma  of  the  lung,  which  shows  a  weaker  association  with  tobacco  smoking 
than  with  other  types  of  lung  cancer,  is  also  found  predominantly  in  women,  suggesting  a 
possible  role  for  female  hormones  in  the  pathogenesis  of  this  type  of  lung  cancer  (5). 

In  previous  studies,  ERp  expression  in  NSCLC  tumors  has  been  associated  with 
improved  survival  (9-11),  whereas  the  immunohistochemical  expression  of  ERa  has  been  shown 
to  be  a  poor  prognostic  factor  (9).  Thus,  both  ERs  have  been  proposed  to  play  opposite  roles  in 
cell  proliferation,  with  ERa  promoting  proliferation  and  ERP  having  an  anti-proliferative  effect 
(36,  37).  In  our  study,  we  did  not  find  a  correlation  between  OS  and  RFS  and  ERp  expression, 
but  we  did  find  that  only  the  expression  of  cytoplasmic  ERa  (using  one  antibody)  conferred  to 
patients  a  significantly  worse  RFS,  but  not  OS,  in  multivariate  analysis. 

Several  studies  have  shown  that  estrogen  signaling  plays  a  role  in  the  development  of  the 
epithelium  in  the  lung  and  that  estrogen  could  potentially  promote  lung  cancer  (6,  7,  13,  16,  17). 
Additionally,  anti-estrogen  drugs  have  been  suggested  to  have  a  role  in  the  therapy  of  lung 
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cancer  (6,  19).  NSCLC  cell  lines  and  in  vivo  tumor  xenografts  have  been  shown  to  respond  to 
estrogens,  and  tumor  growth  can  be  inhibited  up  to  40%  by  the  anti-estrogen  fulvestrant  (6).  In 
the  past  few  years,  significant  advances  have  been  made  in  the  development  of  new  molecularly 
targeted  agents  for  lung  cancer  (38).  The  identification  of  the  subset  of  patients  with  NSCLC 
who  will  benefit  with  targeted  therapy  is  a  key  element  in  the  development  of  personalized 
treatment  approaches  in  this  disease.  A  pilot  study  of  combined  therapy  using  fulvestrant  and 
gefitinib  in  advanced  NSCLC  has  shown  to  be  well  tolerated  and  has  demonstrated  some  tumor 
responses  (39).  Our  study  results  strongly  suggest  that  NSCLC  tumors  obtained  from  patients 
with  adenocarcinoma  histology,  female  gender,  and  history  of  never  smoking  have  a  higher 
chance  of  expressing  ERs  and  have  the  potential  to  respond  positively  to  anti-estrogen  therapy. 

To  the  best  of  our  knowledge,  our  study  is  the  first  to  report  an  association  between 
EGFR  mutation  and  ERa  and  (3  expression  in  lung  adenocarcinomas.  Importantly,  we  have 
demonstrated  that  the  correlation  between  ER  expression  and  EGFR  mutation  is  independent  of 
the  clinicopathological  features  associated  with  both  abnormalities,  such  as  adenocarcinoma 
histology,  female  gender,  and  history  of  never  smoking  (40).  Based  on  the  interactions  between 
ER  and  EGFR-signaling  pathways,  there  is  evidence  showing  that  targeting  both  pathways  by 
using  anti-estrogens  (fulvestrant)  and  EGFR  tyrosine  kinase  inhibitors  (gefitinib),  the  antitumor 
effect  in  in  vitro  and  in  vivo  lung  cancer  models  of  the  drug  combination  is  higher  than  in 
treatment  with  each  drug  alone  (19).  Thus,  our  findings  of  an  association  between  the  activation 
of  both  pathways  further  strengthens  the  concept  of  combined  anti-estrogen  and  EGFR  inhibitor 
therapy  for  a  selected  group  of  patients  with  lung  adenocarcinoma. 

Although  PR  expression  has  been  reported  to  be  present  in  NSCLC  cell  lines  and  tumor 
specimens,  the  data  are  controversial  like  those  for  ERs  (11,  12,  15,41,  42).  Out  of  four  studies 
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reporting  on  immunohistochemical  expression  of  PR  in  surgical  resected  and  formalin- fixed 
NSCLC  tissue  specimens  using  different  antibodies,  there  were  two  studies  that  reported  a 
relatively  high  frequency  of  PR  expression  in  tumors  (39%  and  47%)  (12,  15);  the  remaining  two 
reports  showed  no  expression  (11,  42).  In  the  present  study,  PR  was  frequently  (63%)  detected 
in  the  nuclei  of  malignant  NSCLC,  with  a  trend  to  higher  expression  in  squamous  cell  carcinoma 
histology.  We  did  not  find  a  correlation  between  PR  and  any  of  the  clinicopathologic 
characteristics  we  studied,  including  survival.  In  contrast,  Ishibashi  et  al  reported  that  PR 
immunohistochemical  expression  was  higher  in  NSCLCs  obtained  from  females  and  correlated 
with  better  OS  in  stages  I— III  tumors  (12).  In  breast  cancer,  transcription  of  the  PR  gene  is  well 
known  to  be  regulated  by  estrogenic  actions  through  estrogen  receptors,  and  a  positive  PR  status 
is  generally  regarded  as  one  of  the  markers  of  functional  estrogenic  pathways.  In  our  study,  we 
found  no  statistical  correlation  between  PR  and  any  of  the  ER  antibodies  studied.  In  vitro  and  in 
vivo  studies  have  shown  that  administration  of  progesterone  inhibits  the  growth  of  PR-positive 
NSCLC  cell  lines,  which  is  similar  to  what  has  been  shown  to  happen  in  breast  and  endometrial 
carcinomas  (12). 

Lung  cancer  is  believed  to  develop  from  a  series  of  preneoplastic  lesions  in  the 
respiratory  mucosa,  and  these  abnormalities  are  frequently  extensive  and  multifocal  throughout 
the  respiratory  epithelium,  indicating  a  field-effect  or  field-cancerization  phenomenon  (43).  Our 
findings  of  relatively  frequent  expression  of  nuclear  PR  and  lack  of  expression  of  ERa  in  the 
normal  epithelium  adjacent  to  adenocarcinomas  expressing  these  receptors  suggest  that  PR,  but 
not  ERa  expression,  may  represent  a  field-effect  phenomenon.  Of  interest,  all  but  one  case  with 
normal  epithelium  expression  of  PR  showed  expression  of  this  receptor  in  the  corresponding 
tumor.  The  frequent  finding  of  cytoplasmic  ERp  in  normal  epithelium  may  represent  a 
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constitutive  expression  in  normal  respiratory  cells  and  is  probably  not  related  to  the 
carcinogenesis  process  (35). 

In  summary,  our  findings  show  that  ERa  and  (3  and  PR  are  frequently  expressed  in 
NSCLC,  and  ER  expression  distinguishes  a  subset  of  NSCLC  that  has  defined  clinicopathologic 
and  genetic  features.  The  correlation  between  ER  and  EGFR  mutation  in  lung  adenocarcinoma 
suggests  that  it  might  be  important  to  target  both  pathways  simultaneously  for  lung  cancer 
chemoprevention  and  therapy. 
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Figure  Legends 

Fig.  1.  Fig.  1.  Microphotographs  showing  representative  examples  of  immunohistochemical 
expression  of  estrogen  receptors  (ER)  a  (panels  A,  B,  and  C)  and  p  (panels  D,  E  and  F)  and 
progesterone  receptor  (PR;  panels  G,  H  and  I),  in  tissue  specimens  of  non-small  cell  lung 
carcinoma  (NSCLC)  tumors  and  bronchial  epithelium  (panels  A,  D,  and  G)  adjacent  to 
adenocarcinomas.  The  two  NSCLC  histologies  are  represented:  adenocarcinoma  (panels  B,  E 
and  H),  and  squamous  cell  carcinoma  (panels  C,  F  and  I).  ER  a  and  p  expressions  are  shown 
using  antibodies  ER-a4  and  ER-pl,  respectively.  Examples  of  nuclear  expression  and 
cytoplasmic  expressions  are  indicated  by  red  and  blue  arrows,  respectively.  Normal  epithelia 
show  nuclear  expression  of  ER-p  and  PR,  and  cytoplasmic  expression  of  ER-p. 
Adenocarcinomas  show  nuclear  expression  for  all  three  markers  and  cytoplasmic  staining  for 
ER-p.  Squamous  cell  carcinomas  show  nuclear  staining  for  PR,  and  both  nuclear  and 
cytoplasmic  for  ER-p.  The  magnification  of  the  microphotographs  is  200x  for  normal  epithelium 
and  400x  for  tumors. 

Fig.  2.  Kaplan-Meier  curves  showing  recurrence  free  survival  (RFS)  of  NSCLC  patients  for  ERa 
cytoplasmic  (panel  A)  and  ERp  nuclear  (panel  B)  expression. 

Fig.  3.  Representative  examples  of  ERa  and  p  immunohistochemical  expression  ( upper  figures ) 
and  epidermal  growth  factor  receptor  (EGFR)  mutations  ( lower  figures)  in  lung 
adenocarcinomas.  Panel  A :  ERa  (antibody  ERa-4)  positive  in  the  nucleus  of  malignant  cells  and 
sequencing  chromatograms  showing  the  presence  of  mutant  form  of  EGFR  (15  bp  deletion  in 
exon  19;  arrow  indicates  in- frame  deletion  mutation  sequence).  Panel  B :  ERp  (antibody  ERp-1) 
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positive  in  the  nucleus  of  malignant  cells  and  sequencing  chromatograms  showing  the  presence 
of  mutant  EGFR  (L858R  point  mutation  in  exon  21;  arrow  indicates  CTG  to  CGG  mutation). 
Panel  C:  ER  a  and  (1  (same  antibodies  than  panels  A  and  B )  with  negative  expression  in  the 
malignant  cells  and  sequencing  chromatograms  showing  the  presence  of  wild-type  form  for 
EGFR  exon  19  (line  indicates  sequence  746  to  750). 


27 


Table  1.  Frequency  of  ER  and  PR  immunohistochemical  expression*  in  NSCLC  tissue 
specimens 


Marker 

Location 

Adenocarcinoma 

Squamous  cell 

carcinoma 

P  value 

Number 

of  cases 

Positive 

N  (%) 

Number 

of  cases 

Positive 

N  (%) 

ERa-1 

Nucleus 

187 

20(11) 

109 

2(2) 

0.0048 

Cytoplasm 

187 

0 

108 

0 

__t 

ERa-2 

Nucleus 

186 

84  (45) 

110 

23  (21) 

<0.0001 

Cytoplasm 

185 

1(<1) 

111 

0 

1.000 

ERa-3 

Nucleus 

191 

16(8) 

114 

0 

0.0007 

Cytoplasm 

190 

92  (48) 

114 

37  (33) 

0.0064 

ERa-4 

Nucleus 

185 

74  (40) 

109 

25  (23) 

0.0028 

Cytoplasm 

185 

35  (19) 

109 

18(17) 

0.6043 

ERp-1 

Nucleus 

189 

102  (54) 

112 

66  (59) 

0.4022 

Cytoplasm 

189 

185  (98) 

112 

110(98) 

1.0000 

ERp-2 

Nucleus 

174 

83  (48) 

100 

31  (31) 

0.0069 

Cytoplasm 

172 

37  (22) 

100 

16(16) 

0.2685 

PR 

Nucleus 

177 

103  (58) 

112 

78  (70) 

0.05 

Cytoplasm 

176 

0 

112 

0 

Any  expression  score  >0  is  considered  positive. 
’'"Not  tested. 
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Abbreviations:  ER,  estrogen  receptor;  PR,  progesterone  receptor;  NSCLC,  non-small  cell  lung 
cancer;  ERa,  estrogen  receptor-a;  ER(3,  estrogen  receptor-p. 
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Table  2.  Significant  correlations  between  immunohistochemical  expression  of  ER  and  PR*  and 
NSCLC  patients’  clinicopathological  features. 


Histology 

Gender 

Tobacco  History 

ADCA  (n  =  201) 

Female  (n  =  167) 

Never  (n  =  54) 

> 

> 

> 

Estrogen  Receptor 

SCC  (n  =  116) 

Male  (n  =  150) 

Ever  (n  =  262) 

ERa-1  nucleus 

0.0048 

0.0051 

f 

ns1 

ERa-2  nucleus 

<0.0001 

0.0109 

0.0006 

ERa-3  nucleus 

0.0015 

ns 

0.0242 

ERa-4  nucleus 

0.0004 

0.0148 

0.0044 

ERp-1  nucleus 

ns 

ns 

0.0290 

ERp-2  nucleus 

0.0016 

0.044 

ns 

*ER  and  PR  were  tested  using  expression  score. 

+ns  =  not  statistically  significant. 

Abbreviations:  ER,  estrogen  receptor;  PR,  progesterone  receptor;  NSCLC,  non-small  cell  lung 
cancer;  ADCA,  adenocarcinoma  SCC,  squamous  cell  carcinoma;  ERa,  estrogen  receptor-a; 
ERp,  estrogen  receptor-p. 
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Table  3.  Multivariate  RFS  analysis  using  Cox  regression  model  in  NSCLC  patients* 


Variable 

Hazard  ratio 

95%  Cl  of  HR 

P  value 

Lower  limit 

Upper  limit 

ER  as  continuous  variable 

ERa-4  cytoplasm 

1.05 

1.01 

1.08 

0.0068 

ERP-1  nucleus 

1.01 

1.00 

1.02 

0.0034 

Stage  II  vs.  I 

1.90 

1.14 

3.18 

0.0145 

Stage  III/IV  vs.  I 

3.17 

1.98 

5.08 

<.0001 

ER  dichotomized 

ERa-4  cytoplasm:  >0  vs.  0 

1.77 

1.11 

2.81 

0.0156 

ERp-1  nucleus:  >0  vs.  0 

1.36 

0.91 

2.05 

0.1388 

Stage  II  vs.  I 

1.79 

1.08 

2.99 

0.0250 

Stage  III/IV  vs.  I 

3.13 

1.97 

4.99 

<.0001 

*With  only  significant  covariates. 

Abbreviations:  NSCLC,  non-small  cell  lung  cancer;  Cl,  confidence  interval;  HR,  hazard  ratio; 
ER,  estrogen  receptor;  ERa,  estrogen  receptor-a;  ERp,  estrogen  receptor-p. 
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Table  4.  Significant  correlations  between  immunohistochemical  expression  of  ER  and  EGFR 
mutation  status  in  adenocarcinoma 


EGFR  mutation  status 

ER  expression  by  Wild-type  Mutant 

antibody  N  positive/total  (%)  N  positive/total  (%)  P  value 


ERa-3  nucleus 

9/146  (6) 

7/28  (25) 

0.0016 

ERa-3  cytoplasm 

68/146  (47) 

21/27  (78) 

0.0029 

ERa-4  nucleus 

50/143  (35) 

18/27  (67) 

0.0020 

ERa-4  cytoplasm 

20/143  (14) 

13/27  (48) 

<0.0001 

ERp-1  nucleus 

70/145  (48) 

22/27  (82) 

0.00015 

*ER  and  PR  were  tested  using  expression  score. 

Abbreviations:  ER,  estrogen  receptor;  EGFR,  epidermal  growth  factor  receptor;  PR, 
progesterone  receptor;  ERa,  estrogen  receptor-a;  ERp,  estrogen  receptor-p. 
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Supplementary  Table  1.  Demographic,  clinicopathologic,  and  EGFR  mutation  status  data  of  the 
NSCLCs  studied 


Characteristic  Number  (%) 

Gender 
Female 
Male 

Smoking  status* 

Former 
Current 
Never 

Histology 

Adenocarcinoma 
Squamous  cell  carcinoma 

Stage 

I 

II 

III 

IV 

EGFR  mutation^ 

No  151  (83) 

Yes  31  (17) 

*Smoking  history  and  status  were  not  available  for  one  patient  with  squamous  cell  carcinoma. 
^ EGFR  mutation  data  was  available  in  182  cases. 

Abbreviations:  EGFR,  epidermal  growth  factor  receptor;  NSCLC,  non-small  cell  lung  cancer. 


202  (63) 
62  (20) 
46(15) 
7(2) 


201  (63) 
116(37) 


160  (51) 
102  (32) 
54(17) 


167  (53) 
150(47) 
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Supplementary  Table  2.  Antibodies  used  for  immunohistochemical  analysis  of  ER  and  PR  in 
NSCLC  tissue  specimens 


Receptor 

Raised  against 

Clone 

Company 

Dilution 

Code* 

ER-a 

Full  length 

6F  11 

Novocastra 

1:400 

ERa-1 

Full  length 

6F  11 

Chemicon 

1:50 

ERa-2 

C-term 

HC20 

Santa  Cruz 

1:500 

ERa-3 

N-term 

1D5 

Lab  Vision 

1:50 

ERa-4 

ER-p 

aa  1  to  150 

H150 

Santa  Cruz 

1:100 

ERp-1 

aa  1  to  153 

14C8 

Genetex 

1:100 

ERp-2 

PR 

— — : - 7TT 

Full  length 

SP2 

Lab  Vision 

1:75 

PR 

Code  utilized  to  refer  to  each  antibody  in  this  study. 

Abbreviations:  ER,  estrogen  receptor;  PR,  progesterone  receptor;  NSCLC,  non-small  cell  lung 
cancer;  ERa,  estrogen  receptor-a;  ER(3,  estrogen  receptor-(3;  aa,  aminoacid 
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Supplementary  Table  3.  Summary  of  the  Spearman’s  correlation  between  ERa  and  ERp 
antibodies  examined. 


Nucleus 

ERa-2 

ERa-3 

ERa-4 

ERa-1 

r  0.41  -P<  0.0001 

r  0.48 -P<  0.0001 

r  0.34  -P<  0.0001 

ERa-2 

— 

r  0.32  —  P<  0.0001 

r  0.44  -P<  0.0001 

ERa-3 

— 

— 

r  0.41  -P<  0.0001 

ERp-1 

ERP-2 

r  0.02  —  P  =  0.79 

Cytoplasm 

ERa-4 

ERa-3 

r  0.43  —P<  0.0001 

ERP-1 

ERp-2 

r  0.17  -P  =  0.005 

Abbreviations:  ERa,  estrogen  receptor-a;  ERp,  estrogen  receptor-p. 
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Supplementary  Table  4.  Expression  of  ER  and  PR  in  the  respiratory  epithelium  adjacent  to  lung 


adenocarcinomas 

ERa* 

ERpf 

PR 

Tissue  site 

Nuclear 

Cytoplasm 

Nuclear 

Cytoplasm 

Nuclear 

N  +  (%) 

N  +  (%) 

N  +  (%) 

N  +  (%) 

N  +  (%) 

Tumor 

21/51  (41) 

6/51  (12) 

19/47  (40) 

41/47  (87) 

27/36  (75) 

Bronchi 

0/35 

0/35 

2/37  (5) 

34/37  (91) 

18/32  (56) 

Bronchioles 

0/83 

0/83 

8/81  (10) 

68/81  (84) 

43/71  (61) 

Alveoli 

0/15 

0/15 

0/14 

4/14  (29) 

5/15  (33) 

*  Antibody  ERa-4. 

^  Antibody  ERp-1 . 

Abbreviations:  ER,  estrogen  receptor;  PR,  progesterone  receptor;  ERa,  estrogen  receptor-a; 
ER(3,  estrogen  receptor-p. 


36 


Figure  1 


ER-a 


Normal  Epithelium 


A 


ER-p 

/  / 

• -j,  llL  ♦  1  |  f  M 

D 

✓ 

jl  *  \ :  *  V  *  ~ 

j  •  •  *  *  "  ^ 

PR 

m 

V 

*  X. 

g~  -7  ; . 

.  ,/i- 

;t.v 

z  •  # 

>  >  w 

-  *  *  * 

Adenocarcinoma  Squamous  Cell  Carcinoma 


A.  ER  -a  Cytoplasmic  Expression 


•4* . +  ••• 

'•+•+■ - 1 — h  +  + . +”4"+ 

High  (>0)  (E/N  =  24/51) 


P  =  0.006 

- 1 - 1 - 1 - 1 - 1 — 

0  2  4  6  8  10 

Time  to  Recurrence  (Years) 


High  (>0)  (E/N  =  65/167) 


P=  0.051 

- 1 - 1 - 1 - 1 - 1 — 

I  2  4  6  8  10 

Time  to  Recurrence  (Years) 


Figure  3 


A.  ER  a  Positive 


Mutant 


B.  ER  (3  Positive 


C.  ER  a  and  p  Negative 


wm 

^  jl 


_ 


_ 

'  ,  } 

i  -■  ,  -Sri* 

Sv V 

•  i  **. 

t  i  r* 

%  k 

*  ^ 

V  .  V*  * 

i  /  / 

\  ” 

'  4  * 

**  **'  »  J  .  *  * 

*  V 

«  4*  *V  "  1 

■* 

1*  V*  1 

>5> 

>  i  * 

■  ■ 

/'  '•  ■'  ' 

(#fc 

A  A  G  G  A  AT  T  A  A 

G  A  G  A  A  G  C  A  A  C  A 

T/G 


746 


750 


Mutant 


Wild-type 


TTF-1  Gene  Amplification  and  Protein  Expression  Pattern  Identify 
Adenocarcinoma  of  Lung  with  Worse  Prognosis. 

Ximing  Tang1,  Menghong  Sun1,  Carmen  Behrens1,  Ludmina  Prudkin1,  Natalie  Ozbum1, 
Adi  F.  Gazdar3,  Cesar  Moran1,  Marileila  Varella-Garcia2,  and  Ignacio  I.  Wistuba1 
UT  MD  Anderson  Cancer  Center,  Houston,  TX  77030;  University  of  Colorado,  Aurora, 
CO  80011;  3UT  Southwestern  Medical  Center,  Dallas,  TX  75235 

Thyroid  transcription  factor  -1  (TTF-1),  a  lineage-specific  transcription  factor  frequently 
overexpressed  in  lung  adenocarcinoma,  has  been  recently  reported  to  show  gene 
amplification  in  a  subset  of  these  tumors.  To  better  characterize  TTF-1  copy  number  in 
non-small  cell  lung  carcinomas  (NSCLC)  and  correlate  it  with  protein  expression,  we 
studied  both  gene  copy  number  and  protein  expression  using  fluorescent  in  situ 
hybridization  (FISH)  and  immunohistochesmistry  stain  (IHC)  assays  in  a  large  series 
(N=324)  of  surgical  resected  NSCLCs  placed  in  tissue  microarrays,  including  205 
adenocarcinomas  and  119  squamous  cell  carcinomas.  We  correlated  our  findings  with 
patients’  clinico-pathologic  characteristics,  and  in  a  subset  of  adenocarcinomas  with 
EGFR  (exons  19-21)  and  KRAS  (exons  1  and  2)  mutation  status.  TTF-1  amplification 
(FISH+,  clustered  gene  signals)  was  detected  in  19%  (51  out  of  269)  of  tumors,  without 
differences  by  histology  (18%  of  squamous  cell  carcinoma  and  19%  of  adenocarcinoma). 
TTF-1  protein  high  level  expression  (IHC+,  semiquantitative  score  >200,  range  0-300) 
was  detected  exclusively  in  adenocarcinomas  (48%  of  cases),  and  in  this  tumor  type 
correlated  with  gene  amplification  (/3=0.005).  No  correlation  between  TTF1 
abnormalities  and  patients’  age,  gender,  smoking  status  and  TNM  stages  was  detected.  In 
adenocarcinomas,  IHC+,  but  not  FISH+,  correlated  with  EGFR  and  KRAS  mutations: 

IHC+ was  more  frequently  found  in  EGFR  (16/21,  76%  vs.  59/172,  34%,  P0.001)  and 
KRAS  (8/11,  72%  vs.  26/75,  34%,  iMl.016)  mutant  compared  with  wild-types  tumors. 
Survival  analysis  showed  that  for  adenocarcinoma  TTF-1  FISH+  correlated  with  worse 
recurrence-free  survival  (P=0.001),  while  IHC+ correlated  with  better  recurrence-free 
survival  (P=0.036).  Our  findings  indicate  that  TTF-1  amplification  occurs  in  a  subset  of 
NSCLCs,  including  both  major  tumor  histologies:  adenocarcinoma  and  squamous  cell 
carcinoma.  The  association  of  TTF1  expression  with  EGFR  and  KRAS  mutation  in  lung 


adenocarcinomas  may  correlate  with  the  peripheral  airway  origin  of  these  tumors.  Both 
TTF1  gene  amplification  and  protein  expression  correlate  with  NSCLC  patients’ 
prognosis.  (Supported  by  Grant  DoD-W81XWH-04-l-0142  and  W81XWH-05-2-0027). 


TITF-1  and  EGFR  gene  copy  variations  are  associated  with  prognosis  for  the 
patients  with  non-small  cell  lung  cancer 

Ximing  Tan,  Diane  Liu,  Carmen  Behrens,  Dandan  He,  Menghong  Sun,  David  Rice,  J. 
Jack  Lee,  Waun  K.  Hong,  and  Ignacio  I.  Wistuba 
UT  MD  Anderson  Cancer  Center,  Houston,  TX  77030 

Thyroid  transcription  factor  -1  (TITF-1,  a  lineage-specific  transcription  factor),  and  the 
epidermal  growth  factor  receptor  (EGFR,  a  tyrosine  kinase  membrane  receptor)  have 
shown  frequent  gene  amplification  in  non-small  cell  lung  carcinomas  (NSCLC).  We 
investigated  the  clinico-pathologic  characteristics  of  NSCLCs  having  TITF-1  andor 
EGFR  gene  copy  number  abnormalities  by  examining  gene  copy  number  status  using 
quantitative  polymerase  chain  reaction  (qPCR)  and  DNA  extracted  from  microdissected 
formalin-fixed  and  paraffin-embedded  tissue  from  53  adenocarcinomas  and  29  squamous 
cell  carcinomas.  B-actin  gene  was  used  as  reference.  In  tumors,  gene  copy  ratio  referred 
to  fi-actin  ranged  from  0.22  to  74.93  (median=1.52)  for  TITF-1,  and  0.05  to  6.28 
(median=1.51)  for  EGFR.  Ratios  1  to  2  were  defined  as  normal  gene  copy  number  (NGC). 
Ratios  <1  and  >2  were  defined  as  low  gene  copy  (LGC)  and  high  gene  copy  (HGC) 
number,  respectively.  Both,  LGC  and  HGC  categories  were  defined  as  abnormal  gene 
copy.  Similar  frequencies  of  TITF-1  and  EGFR  copy  number  categories  were  detected 
comparing  adenocarcinoma  ( TITF-1 :  LGC  15,  28%;  NGC  20,  38%;  HGC  18,  34%; 

EGFR:  LGC  9,  17%;  NGC  27,  51%;  HGC,  17,  32%)  with  squamous  cell  carcinoma 
(TITF-1:  LGC  8,  28%;  NGC  14,  48%;  HGC  7,  24%;  EGFR  :  LGC  5,  17%;  NGC  18,  62%; 
HGC  6,  21%).  We  found  a  statistically  significant  correlation  between  TITF-1  and  EGFR 
copy  numbers  (Spearman  correlation  coefficients. 36,  FS.0008).  In  both  tumor 
histologies,  neither  TITF-1  nor  EGFR  gene  copy  increase  (ratio  >2)  correlated  with 
disease  prognosis.  However,  in  adenocarcinomas,  Kaplan-Meier  and  log  rank  tests 
revealed  that  the  median  time  to  death  was  longer  in  patients  with  normal  copy  number 
compared  with  those  with  abnormal  copies  for  TITF-1  (median  4.76  years,  95%  Cl 
2.95-NA,  P  =  0.04)  and  EGFR  (4.76  years,  95%  Cl  3.13-  NA,  P=0.04).  Moreover, 
adenocarcinoma  patients  with  combined  TITF-1  and  EGFR  abnormal  copy  showed  worse 
overall  survival  (3.56  years,  95%  Cl  3.13-  NA)  compared  with  patients  with  normal  copy 


status  (median  not  reached,  .P=0.003).  In  these  patients,  multicovariate  Cox  modeling 
indicated  that  combined  copy  abnormality  of  both  genes  is  an  independent  factor  for 
worse  overall  survival  (HR  4.566,  P=0.0057).  Our  findings  suggest  that  loss  and  gain  of 
TITF1  and  EGFR  are  frequent  abnormalities  in  both  adenocarcinomas  and  squamous  cell 
carcinomas  of  the  lung,  and  in  adenocarcinoma  patients  correlate  with  disease  outcome. 
(Supported  by  Grant  DoD-W81XWH-04-l-0142  and  W81XWH-05-2-0027). 

( TITF-1  and  EGFR  gene  copies  in  lung  cancer) 


